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Abstract 



We prove comparison results for the Swendsen-Wang (SW) dynamics, the heat-bath 
(HB) dynamics for the Potts model and the single-bond (SB) dynamics for the random- 
cluster model on arbitrary graphs. In particular, we prove that rapid mixing of HB 
implies rapid mixing of SW on graphs with bounded maximum degree and that rapid 
mixing of SW and rapid mixing of SB are equivalent. Additionally, the spectral gap of 
SW and SB on planar graphs is bounded from above and from below by the spectral gap 
of these dynamics on the corresponding dual graph with suitably changed temperature. 

As a consequence we obtain rapid mixing of the Swendsen-Wang dynamics for the 
Potts model on the two-dimensional square lattice at all non-critical temperatures as 
well as rapid mixing for the two-dimensional Ising model at all temperatures. This 
gives the first proof of rapid mixing of a Markov chain for the two-dimensional Ising 
model, which is widely regarded as the most studied model from statistical physics, at 
all temperatures. Furthermore, we obtain new results for general graphs at high or low 
enough temperatures. 
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1 

Introduction and results 



We study the mixing properties of Markov chains for the q-state Potts model. The Potts 
model with q G N states at inverse temperature (3 > consists of the set Op = {1, ... , q} v 
of (not necessarily proper) colorings of a graph G = (V, E) together with the probability 
measure 

where E(a) denotes the set of edges of the graph G with equally colored endvertices 
and Z is the normalization constant. This model, especially in the case q = 2 where 
it is called the Ising model, attracted much interest over the last decades not only in 
Statistical Physics but also in several branches of Mathematics and Computer Science. 
The goal is to sample from tt, at least approximately, and since exact sampling is in 
general not feasible, Markov chains are typically used. There are a couple of (more or 
less efficient) Markov chains to attack this problem and, usually, it is an easy task to show 
that their distributions converge in the long-time limit towards the right distribution. 
However, it is much more delicate to prove bounds on the mixing time, i.e. the number 
of steps a Markov chain has to run in order that its distribution is "close" to its limit 
distribution. In the following, a chain is said to be rapidly mixing for a family of graphs 
if, for each graph of this family, the Markov chain can be defined analogously on it and 
its mixing time is bounded by a polynomial in the number of vertices of the graph. 

We will consider two Markov chains for the Potts model, namely the heat-bath dynam- 
ics and the Swendsen-Wang dynamics. The heat-bath dynamics is the most common 
chain for this purpose. It is a local Markov chain that can be described as follows. Sup- 
pose that the current configuration is o = (cr(v)) veV G f2 P . In each step, a vertex v G V 
of the underlying graph is chosen uniformly at random and a new color is assigned to 
v with respect to the conditional probability given that the color of all other vertices is 
fixed, so that the new configuration r satisfies t(u) = cr(u) for all u ^ v. The heat-bath 
dynamics is proven to be rapidly mixing in several instances, see Section 2.5, but is typ- 
ically slowly (i.e. not rapidly) mixing if the inverse temperature /3 is large. For example, 
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if the underlying graph is the two-dimensional square lattice 1? L of side length L, it is 
known that there exists a critical inverse temperature /3 c (q), such that the heat-bath 
dynamics is rapidly mixing if /3 < /3 c (q) and slowly mixing if /3 > /3 c (q) (the latter seems 
to be proven only for q = 2 but it is at least expected to be true for all q > 2). See 
Section 2.5 for more known results and the specific bounds. 

The second Markov chain under consideration is the Swendsen-Wang dynamics that 
changes the color of a large portion of the vertices in each step. One step of this 
chain, given the current state a G Qp, can be described by the following two-step 
procedure. First, generate a subset A C E(p) of the edges of the graph with equally 
colored endvertices, such that every edge of E(a) is included in A with probability 
1 — e~ 13 . In the second step, assign independently and uniformly at random a color 
from {1, . . . , q} to each connected component of the subgraph (V, A). This gives a new 
Potts configuration. It is widely believed that this Markov chain is rapidly mixing in 
much more cases than the heat-bath dynamics, e.g. at low temperatures (large (3), and is 
therefore the preferred algorithm in practice. Nevertheless, results in this direction are 
rare. Besides some rapid mixing results concerning special classes of graphs like trees, 
cycles and the complete graph, or results for sufficiently high or low temperatures, there 
is no result that shows that it is generally advisable to use Swendsen-Wang instead of 
heat-bath dynamics. 

Our first main result, see Chapter 3, shows that rapid mixing of heat-bath dynamics 
implies rapid mixing of Swendsen-Wang, if the underlying graph has bounded maximum 
degree, which partially confirms the above-mentioned intuition. 

In order to give precise statements we need some notation. We define the spectral gap 
of a Markov chain with transition matrix P as 1 minus the second largest eigenvalue 
of P in absolute value and we denote it by A(P). It is well-known that polynomial 
(in \V\) bounds on the mixing time are equivalent to polynomial bounds on the inverse 
spectral gap, see Lemma 2.3, and since the spectral gap seems to be more convenient 
for comparison results, this is the quantity we are interested in. (Note that we classify 
Markov chains by their transition matrices, since the quantities of interest do not depend 
on anything else.) 

For the following comparison results we consider the two above mentioned Markov 
chains for the Potts model, as defined in Section 2.4, and we use this notation throughout 
this thesis for their transition matrices: 

• Phb for the heat-bath dynamics for the Potts model, see (2.18), 

• Psw for the Swendsen-Wang dynamics for the Potts model, see (2.21). 

The first result that we want to present here is a comparison between heat-bath and 
Swendsen-Wang dynamics for the Potts model. 
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Theorem 3.1. Suppose that Psw (resp. Pub) is the transition matrix of the Swendsen- 
Wang (resp. heat-bath) dynamics for the q-state Potts model at inverse temperature (3 
on a graph G with maximum degree A. Then 

A(Psw) > c sw A(P HB ), 

where 

c sw := c sw (A,/3,g) = q' 1 {qe 2 ^ . (1.1) 

This result implies rapid mixing of the Swendsen-Wang dynamics in some new in- 
stances. But, since the heat-bath dynamics is typically slow at low temperatures, it is 
not helpful for large values of 0. For this reason we switch to dynamics for a closely 
related model on the edges of the underlying graph, i.e. the random- cluster model, where 
it was possible to deduce some lower bounds on the spectral gap at low temperatures 
from lower bounds at high temperatures. The random-cluster model consists of the set 
of subsets of the edges Qrc = {A : A C E} and the probability measure 

^ = Ki^r^ ACE ' 

where c(A) denotes the number of connected components of the subgraph (V, A) and 
p := 1 — e~^. In fact, the Swendsen-Wang dynamics is based on the tight connection 
of Potts and random-cluster models. That is, there exists a clever coupling of n and /i, 
say u, such that the conditional probabilities, given either a Potts or a random-cluster 
configuration, are equal to the probability distributions that are involved in the first and 
the second step of the Swendsen-Wang dynamics, respectively, see Section 2.3. 

Using this construction it is obvious that the Swendsen-Wang dynamics (with its 
two steps in reversed order) defines also a Markov chain for the random-cluster model. 
Additionally, we define the single-bond dynamics which is a local Markov chain for the 
random-cluster model that chooses an edge e G E of the graph uniformly at random 
and includes or deletes e from the current random-cluster configuration with a certain 
probability, see (2.27). The transition matrices of these two dynamics are denoted as 
follows: 

• Psw for the Swendsen-Wang dynamics for the RC model, see (2.22), 

• Psb for the single-bond dynamics for the RC model, see (2.27), 

where RC model means random-cluster model. As an easy corollary of the construction 
of the Swendsen-Wang dynamics we obtain A(Psw) = A(Psw)? see Lemma 2.6. Thus, 
every result onP sw immediately yields a result on Psw- 
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It turns out that Swendsen-Wang and single-bond dynamics can be represented on the 
joint Potts/random-cluster model (i.e. the model corresponding to the coupling v) using 
the same "building blocks", which leads to the second main result of this thesis. We 
prove that rapid mixing of Swendsen-Wang is equivalent to rapid mixing of single-bond 
dynamics. 

Theorem 4.8. Let Psw (resp. Psb) be the transition matrix of the Swendsen-Wang 
(resp. single-bond) dynamics for the random- cluster model on a graph with m>3 edges. 
Then 

A(Psb) < A(Pgw) < 8mlogm A(P SB ). 



Finally, we restrict ourself in Chapter 5 to a special class of graphs, namely planar 
graphs, and, using the notion of dual graphs, it will be possible to relate the mixing 
properties of the Swendsen-Wang dynamics on the original graph to the mixing prop- 
erties on its dual with a suitable change of the temperature parameter. This is done 
by proving such a result for the single-bond dynamics and, using the comparison result 
above, translating it to the Swendsen-Wang dynamics. 

For this, assume that the underlying graph is planar, i.e. can be drawn in the plane 
without intersecting edges. Furthermore, given a random-cluster model on a planar 
graph G with parameters p and q, we call the random-cluster model on the dual graph 
with parameters p* = p ^^ p ^ and q the dual model. 

Theorem 5.6. Let P sw (resp. Psb) be the transition matrix of the Swendsen-Wang 
(resp. single-bond) dynamics for the random- cluster model on a planar graph G with m 
edges and let Pg W (resp. P$ B ) be the SW (resp. SB) dynamics for the dual model. Then 

A(Psb) < gA(P s f B ) 

and 

A(Psw) < 8qmlogm\(Pl w ). 



Since the dual model of the dual model is the primal random-cluster model (if G is 
connected), we obtain the bound in the other direction by applying Theorem 5.6 twice. 
This theorem allows us to obtain some new results on rapid mixing at low temperatures 
directly from known results at high temperatures. 

Now we turn to applications of the theorems above. But first, note that we do not give 
a direct analysis of the Markov chains under consideration in this thesis. Thus, all results 
of rapid mixing in specific settings rely ultimately on already known mixing results. In 
fact, the three results given below are based on lower bounds on the spectral gap of the 
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heat-bath dynamics for the Potts model. We refer to Section 2.5 for a collection of all 
previously known results that are necessary for the analysis of this thesis. 

The first application, which was the main reason for our study, deals with the two- 
dimensional square lattice 1? L of side length L. This is the graph 1? L = (Vl,2, El,2) with 
vertex set Vl,2 = {l,---,-^} 2 C 1? and edge set El,2 = {{u,v} C Vl,2 '■ \u — v\ = l}, 
where | • | denotes the Euclidean norm. 

Theorem 5.9. Let Psw be the transition matrix of the Swends -en-Wang dynamics for 
the q-state Potts model on 7? L at inverse temperature (3. Let n = L? . Then there exist 
constants cp = cp(q), d > and C < oo such that 

• A(Pgw) > ^ for(3<p c (q), 

• A(Pgw) > for(3>(3 c (q), 

n l log n 

• A(Pgw) > c'n~ c for q = 2 and = f3 c (2), 
where (3 c (q) = log(l + y/q). 

This result shows rapid mixing of the Swendsen-Wang dynamics for the Potts model 
on the two-dimensional square lattice Z| at all non-critical temperatures, i.e. at all 
P 7^ 0c(q), as we h as rapid mixing at all temperatures in the case q = 2. Note that it 
was not even known that the SW dynamics mixes rapidly for /3 < (3 C , as it is known for 
the heat-bath dynamics. Additionally, no Markov chain was proven so far to be rapidly 
mixing for all j3 > j3 c . 

As a byproduct we obtain the following result for the single-bond dynamics for the 
random-cluster model. 

Theorem 5.10. Let Psb be the transition matrix of the single-bond dynamics for the 
RC model on Z| with parameters p and q. Let m = 2L(L — 1) = |Pl,2|- Then there 
exist constants c p = c p (q), d > and C < oo such that 

~ c 

• A(P SB ) > — jt 5 forp^p c {q), 

mr log m 

• A(Psb) > c'm~ for q = 2 and p = p c {2), 
where p c (q) = j^=. 

For the third application we consider the more general family of planar graphs of 
bounded maximum degree. 
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Corollary 5.7. The Swendsen-Wang dynamics for the random- cluster model with pa- 
rameters p and q on a planar, simple and connected graph G with m edges and maximum 
degree A > 6 satisfies 

~ c(l - e) <=■ 
A(Psw) > J if V < 



and 

A(Psw) > 4^^, ifp>l 



m 2 log m ' ' ' gAt ' 

for some c = c(A,p, q) > and e > 0, where A^ is the maximum degree of a dual graph 
ofG. 



Additionally, we present a similar result for arbitrary graphs of bounded maximum 
degree. In this case the Swendsen-Wang dynamics is proven to be rapidly mixing if 
p < e/A, see Corollary 3.3 and Lemma 2.6. These results enlarge the previously known 
set of temperatures where rapid mixing is known, see [34] or Theorem 2.18, but lead to 
a worse bound on the spectral gap. 
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In this chapter we provide the necessary definitions and notations for this thesis. The 
experienced reader could skip this chapter and revisit it when necessary. 

First we give an elementary introduction to Markov chains on finite state spaces. See 
Levin, Peres and Wilmer [37] for more details. Then we define the measure of efficiency of 
Markov chains which we are concerned with, the spectral gap, and explain the relation of 
this quantity to the rate of convergence of a Markov chain to its stationary distribution, 
as well as the relation to another quantity, the mixing time. In the subsequent sections we 
focus on Markov chains for Potts and random-cluster models, and give precise definitions 
of the models and the used Markov chains. Finally, in Section 2.5, we state several known 
results on their mixing properties. 

2.1 Markov chains 

Let Q be a finite set, P be a Q x fi-matrix with P(x, y) > and XLefi P(. x -> z) = 1 for all 
x,y G Q, and X = (X t )tes, X t G Q, be a sequence of f2-valued random variables defined 
on a probability space (X, J 7 , P). We call X a Markov chain on Q with transition matrix 
P if for alH > 1 and all Xq, ...,ijGfl, such that P(X — Xq, . . . , X t -\ = x t -i) > 0, we 
have 

F(X t =x t \X = x ,..., X t _! = x t - x ) = P (X t = x t | X t _! = x t - x ) 

= P(xt-i,X t ). 

This is called the Markov property. Note that this implies that, for all x,y G Q and 
s G N with F(X S = x )>0, 

P\x,y) = ^P*- 1 (x,^)P(^,y) = P (X s+t = y\X s = x) for all t > 0, 

which defines the t-step transition probabilities. It is clear that the Markov property 
implies that the knowledge of P and of the distribution of X = (X 4 ) 4gN at any time, 
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say 0, determines the distribution of the Markov chain at all future times: Suppose that 
the distribution of the Markov chain at time is given by some probability mass function 
7/0, i.e. i]q(x) = P(X = x) for all x G Q, then the distribution of the Markov chain at 
time t > 1 satisfies 

¥{X t = x) = Y,^)P\z,x). (2.1) 

Therefore, we can simulate one realization of the Markov chain X on Q with transition 
matrix P and initial distribution rj by generating first a state Xq with respect to the 
distribution r/ , and then generating successively the state x t with respect to P(x t ~i, ■), 
t > 1. Especially, if the initial distribution of the Markov chain is concentrated at a 
single state, i.e. n (x) = l(x = Xq) for some x G the distributions of the Markov 
chain take the simple form F(X t = x) = P t (xo,x). Here, the indicator function 1 is 
defined by \{x = x ) = 1 if x = x and l(x = x ) = otherwise. In general, 1 equals 1 
if the statement in the parentheses is true and equals otherwise. From now on we will 
identify probability mass functions with row-vectors, such that equality (2.1) simplifies 
to F(X t = x) = r/QP t (x) and for a probability mass function tt we define, for 4c0, the 
probability measure tt by 

ir(A) := ^tt(x). 

(By convention, we will use tt interchangeably as a measure and a mass function.) 
Markov chains are typically used to sample (approximately) from distributions for which 
direct simulation is not feasible. For this it is necessary that, at least in the limit, the 
distribution of the Markov chain reaches this distribution, say tt, i.e. 

tt(x) = lim F(X t = x) for all (2.2) 

t— >oo 

It is in general not guaranteed that such limits exist, but we can present a sufficient condi- 
tion for existence. We call a Markov chain irreducible if for all x, y G £1, Bt : P f (x, y) > 0, 
and aperiodic if for all x G Q: gcd {t > 1 : P t (x, x) > 0} = 1, where gcd{ J} denotes the 
greatest common divisor of all elements in J C N. Clearly, for an irreducible Markov 
chain, P(x, x) > for any x G £1 is a sufficient condition for aperiodicity. All Markov 
chains of this thesis will be irreducible and aperiodic, and we use ergodic as an abbrevi- 
ation for irreducible and aperiodic. It is well-known that the above limit (2.2) exists for 
aperiodic Markov chains X and, if the Markov chain is also irreducible, the limit distri- 
bution tt is independent from the initial distribution 7/ and satisfies n(x) > 0, Va; G Q. 
Thus, ttP = tt, which follows obviously from (2.2), is equivalent to (2.2) whenever X 
is an ergodic Markov chain with transition matrix P. We call a distribution tt with 
ttP = tt a stationary distribution of the Markov chain with transition matrix P. 

Another important concept is the reversibility of Markov chains. A Markov chain 
with transition matrix P is reversible (or satisfies detailed balance) with respect to tt if 
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for all x,y £ Q 

7t(x)P(x,y) = n(y)P(y,x). 

It is not hard to prove that, under this condition, it is a stationary distribution of P. 
Finally, we call a Markov chain lazy, if its transition matrix P satisfies P{x,x) > ~, 
Vx G Vt. Obviously, lazy Markov chains are aperiodic. If X is a Markov chain with 
transition matrix P, and X is a Markov chain with transition matrix Q, such that 
Q(x, y) = \P{x, y) for all x ^ y, x,y G Q, and f(X = x) = F(X = i), Vx 6 fl, then 
we say that X is the lazy version of X. Necessarily, Q(x, x) — | + \P(x, x) for all x G Q. 

Throughout this thesis we will denote properties of a Markov chain with transition 
matrix P as properties of P, unless they depend on more than P. For example we say 
for a ergodic Markov chain with transition matrix P that is reversible with respect to 
7T, that P is ergodic and reversible with respect to ir. 



2.2 Spectral gap and mixing time 

In the following we want to estimate the efficiency of Markov chains for approximate 
sampling from their stationary distribution. For more details and aspects of the conver- 
gence of Markov chains to their stationary distribution see Levin, Peres and Wilmer [37]. 

To quantify this efficiency we first define the total variation distance of two distribu- 
tions v and 7r on Q by 

II^-tIItv := g^lK^) -*( x )\ = max|z/(A) -tt(A)|. 

See [37, Prop. 4.2] for the second equality. Using this as a metric on the set of all 
probability measures on Q it is natural to ask how fast the convergence in (2.2) takes 
place, i.e. how fast the distribution of a Markov chain for increasing time t converges 
to its stationary distribution. The next statement, which is called the Convergence 
Theorem, provides a more quantitative version of (2.2), see e.g. [37, Theorem 4.9]. 

Theorem 2.1. Let P be ergodic with stationary distribution ir. Then there exist con- 
stants a G [0, 1) and C > such that 

maxllPHx, •) - ttIL, < Ca\ (2.3) 

This theorem shows that, for every x G Q, the distribution of a Markov chain with 
transition matrix P and initial distribution concentrated at x converges exponentially 
fast (in total variation) to its stationary distribution. In fact, the worst initial distribu- 
tion, i.e. the distribution n that maximizes ||?7oP*(-) — vr|| TV , is concentrated at a single 
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state. To see this, note that for a distribution r) on Q 



Ti 



TV 



< ^ % (x)||p*(x,-; 



7T 



TV 



TV 



7T < max P 4 (x, •) - 7T 

MTV xen II IITV 



Thus, it is enough to consider the maximum as in Theorem 2.1 to get a statement for 
arbitrary initial distributions. 

For given e > 0, we are interested in the minimal time t such that the total variation 
distance between the distribution of our Markov chain at time t and its stationary 
distribution is at most e, independent from the initial distribution. This t is called the 
e-mixing time and is defined by 



t mix (P, e) := min < t > : max IIP 



xeQ 



X, • - 7T 



TV 



< e 



Using the fact that t mix (P,e) < [logfe^ 1 )] t m ix(P, l/2e), see e.g. [37, p. 55], it is enough 
to consider the mixing time t mix (P) := t m i x (P, l/2e). Unless otherwise stated log denotes 
the natural logarithm. Obviously, bounds on i mix imply (and follow from) bounds on the 
optimal, i.e. smallest, constants C and a in Theorem 2.1. To be precise, if we know some 
constants C and a for which (2.3) holds, we obtain t m i x (-P) < log(2eC) ■ log(l/a) _1 . For 
the reverse direction note that 



max ||P*(x, 
xen 11 



TV 



Itv ^ e 



- [t/t n 



< e 



i-t/t D 



with t mix = t mix (P), see [37, eq. (4.34)]. 

In this thesis we are concerned with bounding the optimal constant a in (2.3) and, 
in particular, comparing the optimal constants for different Markov chains. We will see 
(Lemma 2.3) that this also implies bounds on the mixing time. 

For this we introduce the spectral gap of a Markov chain. Let P be the transition 
matrix of a Markov chain on Q that is ergodic and reversible with respect to 7T. We 
regard P as an operator that maps functions / : O — > R to functions by 



Pf{x) := ^P(i,y)/(y). 

yen 



(2.4) 



Such an operator is called the Markov operator that corresponds to P and we will use 
the same notation for the Markov operator and its corresponding transition matrix. 
Note that Pf(x) is the expectation of / with respect to the distribution P(x, •), i.e. 
the distribution of the Markov chain with transition matrix P and initial distribution 
concentrated at x G f2 after one step. If we consider a function / on Q as an element 
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of MP (i.e. a column vector), then Pf is simply matrix multiplication. Additionally, we 
endow the function space M. n with the inner product 

(/, g)« ■= n(x), f, g E R n , (2.5) 

and denote by L 2 {ti) the inner product space (or Hilbert space) that consists of M. Q with 
the inner product (•, •)„-. Especially, the norm in L 2 (7r) is given by 

12 - </,/>* = ^/(x) 2 vr(x), /G™ n 



Using the inner product (2.5) we define the adjoint operator P* of P as the (unique) 
operator that satisfies (f,Pg) n = (P*f,g) n for all /, g e L 2 (vr). The operator P* is also 
a Markov operator and the corresponding transition matrix is 

P*(x,y) = ^-P(y,x), x,yeQ. (2.6) 

Since P is reversible with respect to n, we obtain that P = P*. Hence, P defines a 
self-adjoint operator. This implies that P has only real eigenvalues, i.e. real numbers £ 
with Pf = £/ for some ^ / G see e.g. [35, Thm. 9.1-1]. By the ergodicity of P, 
these eigenvalues satisfy — 1 < & < 1 and, additionally, Pf = f if and only if / is 
constant, see [37, Lemma 12.1]. 

We define the (absolute) spectral gap of P by 

X(P) := 1 — max/ 1^| : £ is an eigenvalue of P, £ ^ l\. 

It is well-known that the spectral gap can be written in terms of norms of the Markov 
operator P. For this we define the spectral norm (or simply operator norm) of an 
operator P by 

\\ P \U '■= ll-Plkw-h^M = max \\Pf\\*. (2.7) 

ll/IU— i 

We use || • \\ n interchangeably for functions and operators, because it will be clear from 
the context which norm is used. For a self-adjoint P the operator norm \\P\\ n equals the 
largest eigenvalue of P in absolute value. To give a representation of the spectral gap 
we need also the operator S n that is defined by S^f = (f,l) n . This (Markov) operator 
corresponds to the (transition) matrix S n (x,y) = ir(y) for x, y 6 Q. Obviously, S n has 
only eigenvalues 1 and 0, and the eigenspace to eigenvalue is {/ G L 2 (tt) : (/, l) n = 0}, 
which is also the union of all eigenspaces of P for eigenvalues different to 1. Thus, 

X(P) = 1- ||P -5X . (2.8) 
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As stated above, the spectral gap and the speed of convergence in Theorem 2.1 are 
closely related. The next lemma, see [37, Coro. 12.6], demonstrates this relation. 

Lemma 2.2. Let P be ergodic and reversible with respect to it. Then 

( \ V * 
lim max \\PHx, ■) - it L, = 1 - A(P). 
<->oo V xen 11 iiTvy 

This shows that 1 — A(P) is the optimal constant a in (2.3). One may hope that this 
asymptotic equality holds, at least approximately, also non-asymptotically, i.e. that there 
exist constants c, C > such that 

c(l-X(P)Y < msx\\P\x,-) -tt|| tv < C(1-A(P))* 

for all i 6 N, and, in fact, this inequality holds with constants c = ~ [37, (12.13)] and 
= min xg n 7r (x) [37, (12.11)]. Plugging this into the definition of the mixing time 
we obtain that mixing time and spectral gap of a Markov chain (on finite state spaces) 
satisfy the following inequality, see e.g. [37, Theorem 12.3 & 12.4]. 

Lemma 2.3. Let P be the transition matrix of a reversible, ergodic Markov chain with 
state space Q and stationary distribution it. Then 

A(P)- 1 - 1 < t mh[ (P) < log ( — ) A(P)-\ 

\ 7f min / 

where 7r min := min xen tt{x) . 

We are interested in the spectral gap of specific Markov chains and, in particular, in the 
dependence on the size of the state space if the Markov chain can be defined analogously 
on an unbounded family of state spaces. Thus, if we consider an indexed family of state 
spaces {f2 n } ne N with a corresponding family of transition matrices {P n } ng N, we say that 
the Markov chain is rapidly mixing for the given family if A(P n ) _1 < clog(|f2 n |) for all 
iiGN and some c, C < oo. 

Remark 2.4. The main reason for sampling from a given probability distribution n 
on Q is probably the approximation of expectations S n f = (f, l) n for certain functions 
f G MP. This is frequently done e.g. in Statistical Physics to deepen the understanding 
of the underlying model. If exact sampling from tt is not feasible, then this often can be 
done by Markov Chain Monte Carlo methods that can be described as follows. Choose 
an initial state Xq G Q (deterministically or by some distribution), then simulate k steps 
of the Markov chain with transition matrix P to obtain a sequence x±, . . . ,Xk G f2 and, 
finally, take the average At{f) := Yli=o f( x i) ■ Usually it is better to omit several 
states at the beginning of the sequence to improve the performance, see e.g. Rudolf [60]. 
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The number of these omitted states is called burn-in in the literature. We know from the 
Ergodic Theorem [37, Thm. that lim^jx, A^(f) = S w f almost surely, whenever P 

is irreducible and has stationary distribution it. In this context the spectral gap plays an 
important role to bound the number k of steps of the Markov chain that are necessary to 
achieve a prescribed error e > 0. For a bound on k for the probabilistic error criterion 
that depends on the spectral gap and the e-mixing time see Levin et al. [37, Sec. 12.6]. 
Another bound that deals with the mean square error and depends only on the spectral 
gap is given in [58, 59], see also Novak and Wozniakowski [53] for some context and 
results in a more general setting. Overall, one can say that the existence of a rapidly 
mixing Markov chain for a family of state spaces {f2 n } ng N with corresponding measures 
{7r n } ng N leads to an efficient algorithm for the approximation of expectations of functions 
(with respect to 7i n ) defined on Q n , i.e. an algorithm that needs time proportional to s~ 2 
times a polynomial in log(|f2 n |) (times the variance of the considered function). 



To finish this section we present a simple technique to compare the spectral gaps 
of two Markov chains on the same state space, but with possibly different stationary 
distributions. This result is well-known, see e.g. [15] or [37], but since it is used several 
times in this thesis we present its proof here. See also Dyer, Goldberg, Jerrum and 
Martin [21] for a survey on other (and more general) techniques for comparison of Markov 
chains. 

Lemma 2.5. Suppose Pi (resp. P 2 ) is an ergodic and reversible transition matrix with 
stationary distribution 7i"i (resp. n 2 ) on Q. If there exist constants a, A > such that 

7r 1 (x)P 1 (x,y) tti(x) 

for all x,y 6 £1, then 

A(Pi) < -A(P 2 ). 
a 

If P 2 has only non-negative eigenvalues it is enough to verify the conditions for x ^ y. 



Proof. For / G Mr define 

S 1 (f) := ((/ - P,)f, f) ni and T x (f) := ((/ + P,)f, f) n 

with the identity If := /. Equivalently we define S 2 and Ti for P 2 . By reversibility and 
ergodicity, P\ has only real eigenvalues 

1 = 6(Pi)>6(Pi)>--->^|(Pi)>-1, 
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i.e. A(Pi) = min{l — &(Pi), 1 + £|n|(Pi)}- Thus, using the "min-max characterization" 
of the eigenvalues [33, Thm. 4.2.2 & 4.2.11] and that £i(Pi) corresponds to the constant 
eigenfunction, we obtain 

1 — ?2(" ij = 1 — max = mm — = mm 



, ff°-- (/,/)** ff* CAAn , Var wi (/) 

with Var^(^) = (g,g) Kl - 1)^ for <? G R n . Noting that £(/) = £(/ + c) and 
Var,^/) = Vaxmif + c) for every c G R, it follows 

l-&(Pi) = min ct ' (ij 



Vax wi (/)' 

Var^ (/)^0 

where Var,,.^/) ^ iff / is not constant. Equivalently, 

! , e fr3 \ i . ■ (Pf,f)m ■ Fi{f) 

1 + qn|("i) = 1 + mm , „ „, = mm 



/*> (/, /) Wl /A) (/, /) 
It is easy to check (using reversibility) that 



2 



and 

^(/) = 2 E (/(^) + /(z/))V(x)P 1 (x, 2/ ). 

It follows from the first assumption of this lemma that £i(/) < A 82(f) and J~x(f) < 
A^if) for every / G R n . The second assumption implies (f,f) ni > a (f,f)n 2 an d 
hence that 1 + 6|n|(Pi) < ~(1 + £|n|(-Pa))- Additionally, for / G R n with (/, 1)^ = 0, we 
obtain 

Var^/) = (/, />„ - (/, < < 1 (/,/>„ = -Var^/), 

which implies Var,,.^/) > aVar 7r2 (/) for every / G R n and thus that 1 — ^(Pi) < 
~(1 — ^2(^2))- Note that if P2 has only non-negative eigenvalues it is enough to compare 
E\ and £2, since A(p2) = 1 — £2^2)- Both do not depend on the diagonal elements of Pi 
and P2. This proves the claim. 

□ 
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2.3 The models 

In this section we introduce the models that we study in this thesis. Although we are 
mainly interested in sampling from the Potts model, we additionally need the closely 
related random-cluster and Fortuin-Kasteleyn-Edwards-Sokal (FKES) models. Since 
all these models are defined on an underlying graph, we begin with providing some 
general graph terminology, see e.g. Diestel [17] or Mohar and Thomassen [52] for a more 
comprehensive introduction to graph theory. 

A graph G is a pair, V and E, where V is the finite set of vertices and E is a set 
of edges, together with a function ip that assigns to each edge e G E a set of at most 
two vertices, i.e. (p(e) = {u,v} for some u, v G V, which are called its endvertices. We 
denote the endvertices of an edge e G E by and e^ 2 \ i.e. ip(e) = {eW,e^}. Let 
(f(E) = {<p(e) : e G E}. We say that u and v are neighbors in G, if {u, v} G (p(E). 
Furthermore, u and v are called connected, if there exist vertices v ,...,v n G V such 
that Vq = u, v n = v and {fj_i,t>j} G <f(E), i = l,...,n. We write u-^-v if u and 
v are connected (in G). Now suppose we have two graphs, G = (Vg, Eg, fc) and 
H = (Vh, Eh,<Ph)- We say that H is a subgraph of G if Vjj C Vg, Eh C -Eg and 
= ^(e) f° r all e G E7#. If additionally = Vq we say if is a spanning subgraph 
of G. Let A C -E be a subset of the edges of the graph G = (V,E,<p). The graph 
Ga '■= (V,A, (p) is then a spanning subgraph of G and we write uAv if u,v e V are 
connected in Ga, i-e. there exist vertices v , . . . , v n G V such that v — u, v n — v and 
G <p(A), % = l,...,n. Clearly, o defines an equivalence relation on V for 
every A C E and its equivalence classes are called connected components of Ga- Two 
distinct edges ei,e2 G E that have the same endvertices, i.e. they satisfy <f(ei) = y?^), 
are said to be parallel and edges with equal endvertices, i.e. e G E with e^) = (or 
\<p{e) \ = 1), are called loops. 

Most graphs of this thesis are simple graphs, i.e. graphs that contain no parallel edges. 
(Some authors use the term graph for simple graphs and graphs, as defined above, are 
called multigraphs.) The advantage of simple graphs is that |<p(i£)| = \E\ and thus, that 
ip is a bijection between E and ip(E). In this case we identify (p(E) with E and write 
G = (V, E) for the graph (V, E, ip), as well as e = {u, v} for (p(e) = {u, v}. With a slight 
abuse of notation we omit <p also if the graph is not simple, i.e. we write G = (V, E) 
for (V,E,ip), and make the convention that {u,v} = e (resp. {u,v} G E) simply means 
{u,v} = <p(e) (resp. {u,v} G ip(E)). (Note that we will use this notation only in one 
direction, i.e. given e G E we obtain a unique {u, v } C V with {u, v} = e. The other 
way cannot be done uniquely if the graph contains parallel edges.) 

As example graphs one can have in mind the two-dimensional square lattice T? L of 
side length L, i.e. the graph Z 2 L = {Vl,2, El,2) with vertex set Vl,2 = {1, • • • , L} 2 C Z 2 
and edge set El^ = {{u,v} C Vl,i '■ \u — v\ = l}, where | ■ | denotes the £2 norm, see 
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Figure 2.1 (left), and a tree, which is a graph T = (V, E) with \E\ = \V\ — 1 and exactly 
one connected component, see Figure 2.1 (right). Obviously, deletion of any edge of T 
increases the number of connected components. 




Figure 2.1: The graph Zj? (left) and a tree (right) 

For the further analysis we need the following graph quantities. First of all, we will 
refer to the number of vertices of a graph, i.e. \V\ for G = (V,E), as the size of the 
graph. The degree of a vertex v in G — (V,E), i.e. \{e £ E : v G e} , is denoted by 
deg G (v ) and we use A(G) for the maximum degree of G, i.e. 

A(G) = max deg G (v). 

For example, A(Z|) = 4 for every L > 3. Additionally, for a graph G = (V,E) we 
denote by c(Ga) the number of connected components of the graph Ga = (V, A), A C i£. 
Thus, c(G J 4) is the number of equivalence classes in 1/ with respect to A. If the graph is 
fixed we simply write c(A) for c(G/). Note that, if T = (V, E) is a tree, c(T^) = \V\ — \A\ 
for all Ad E. 

Now we introduce the models under consideration. For this, fix a graph G = (V,E), 
a natural number q > 1 and a real number /3 > 0. Typically, (3 is called the inverse 
temperature. The q-state Potts model on G is defined as the set of possible configurations 
Q P := Q P (G) = [q\ v , where [q] := {1, . . . , q} is the set of colors (or spins), together with 
the probability measure 

a G ftp, (2.9) 
e} (2.10) 



where 

E{a) := (eEE: o~(u) = cr(v) for {u,v} 
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is the set of edges with equally colored endvertices. The normalization constant (also 
called partition function) Z is given by 

Z(G,P,q) := e " lE(a)l - 
<ren P (G) 

This measure is called the Potts measure (or Boltzmann distribution) and if q = 2 we 
call the Potts model the Ising model. The inverse temperature /3 determines the inter- 
action strength between neighboring vertices in the graph and since /3 is non-negative, 
configurations with more equally colored neighbors have a larger weight in it. Especially, 
at infinite temperature, i.e. — 0, the measure tt is the uniform distribution on Op and 
the larger j3 the more weight goes to the (almost) constant configurations. 

A closely related model is the random- cluster model (also known as the FK-model), 
that was introduced by Fortuin and Kasteleyn [24]. It is defined on the graph G = (V, E) 
by its state space RC = {A : A C E} and the random- cluster (RC) measure 

m - <m) - z^ot te) ' Al q " A ' ACE ' (2 ' u) 

where p G [0, 1], c(A) is the number of connected components in the graph (V,A) and 
Z(-, -, •) is the same normalization constant as for the Potts model [29, Thm. 1.10], see 
(2.13) and (2.14). For a historical treatment and related topics see Grimmett [29]. 

To describe the connection of Potts and random-cluster models we state a coupling of 
the corresponding measures that is due to Edwards and Sokal [23]. This coupling leads 
to the third model that will be considered, namely the Fortuin-Kasteleyn-Edwards-Sokal 
(FKES) model (or simply the joint model). For this let p = 1 — e~^, as we will assume 
henceforth. The joint model is defined on fij := Op x Org by the FKES measure 

"(*,A) := v®((T, A) = : (t^-}\ M t(AcE(a)) (2.12) 



Z(G,log( T ^),g) \l-p 

for (a, A) 6 Oj. Again, the normalization constant Z(-, ■, ■) is the same as for the Potts 
measure [29, Thm. 1.10]. To see this, note that 

A C E(a) V{u,v} G A : a(u) = a(v), 

which implies that a is constant on each connected component of (V,A). Hence, for a 
given A C E there are exactly q c ^ configurations a G Op with u(a, A) > 0. Using this, 
we obtain for a G Op and A C E 

/ \ \A\ / \ \A\ 

E 13^ 1(A C E(t)) = (j^—) |{r G P : Ac E(t)}\ 

" 7 V J (2.13) 

n \ |A| 

i-pj 



reflp 
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and, since p = 1 — e ^, 

(\ |B| i- "v,"j 



\ \ B \ |E(cr)l /I P/ M\ / 

p y \, n j-,, -sr-^ (\E((T)\\ ( p 



k J \1 — p 



\E(a) 

\-p, 



(2.14) 



1 + _P_] ' = e ^)l. 



Summing over A G Org in (2.13) and over a G Op in (2.14) proves that the normalization 
constants of ir, /i and v are equal. Another fact that can be deduced from (2.13) and 
(2.14) is that the marginal distributions of v are exactly n and //, respectively, see [23] 
or [29]. This means that for all a G Op and A C E, 



7r ( (T ) = ^2 u ( a ' A ) = v(&,Qrc) 
Ace 



and 

fj,(A) = u ( a i A ) = v{tt P ,A). 

We define the conditional probability of o G Op with respect to v given A C by 



t/(oyA) 



if (cr, A) G Oj, 



^(alA) := <^ ^ P ' A) ' V ' ' (2.15) 
I 0, otherwise. 

For v{A | a) replace u(Qp, A) by u(a, Org). We obtain 

£ /i(A)^l^) = vr(a) 

and 

= /i(A). 

This can be interpreted in the following way. Assume that we can simulate a random 
variable X G Org that is distributed with respect /x. The random variable F G Op, which 
is obtained by sampling from v(-\X) is then distributed with respect to ir. In fact, the 
conditional probabilities take a rather simple form, namely i/(cr|A) = q~ c( - A ^ l(Ac E(a)) 
and v(A | a) = p |A| (l -p)\ E ^)\-\ A \ t(A C E(p)). Thus, given a RC configuration X ~ /i, 
i.e. one that is distributed with respect to /i, we can generate a Potts configuration 
Y ~ 7r by assigning a random color from {1, . . . ,q} independently to each connected 
component of the graph (V, X). For the reverse way, given Y ~ tt, include all edges 
e = { e (1) , e (2) } G E with F(e (1) ) = F(e (2) ) to X with probability p. Hence, if Potts and 
RC model are defined on the same graph G and p = 1 — e~°, then the possibility of 
efficient sampling from either model enables also efficient sampling from the other. 

We will see in the following section that the considered non-local Markov chain, namely 
the Swendsen-Wang dynamics, is based on this connection of the Potts and random- 
cluster models. 
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2.4 The dynamics 

This section is devoted to the definition of the Markov chains that will be used to 
sample approximately from the Potts and the random-cluster model. In Chapter 3 
and Chapter 4 we will compare the spectral gaps of these Markov chains and, as a 
consequence, obtain new results on the mixing properties of the Swendsen-Wang and 
the single-bond dynamics. Recall that we have fixed a graph G = (V, E), some q G N, a 
real (3 > and p = 1 — e _/3 . We distinguish between dynamics for the Potts model and 
dynamics for the random-cluster model, and we will propose one local and one non-local 
Markov chain in both cases. Local means that the Markov chain changes the current 
state in one step only locally, i.e. let P (resp. P) be the transition matrix of a Markov 
chain on Vlp (resp. I^rc) with stationary distribution tt (resp. a), then we say that P 
(resp. P), or its corresponding Markov chain, is local, if \{v G V : o~(v) ^ t(v)}\ < 1 
for all a,T e Q P with P(a, r) > (resp. \A \ B U B \ A\ < 1 for all A,Be Q RC with 
P(A,B) > 0). Thus, local Markov chains update only one vertex (resp. edge) of the 
current configuration per step. 

Another notion of locality is that the transition probabilities, e.g. P(o~, r) for a, r G flp, 
depend only locally on the states a and r. This is called the finite range interaction 
property of the transition probabilities, see e.g. [44], and means (in the case of the 
Potts model) that for every a, r G Qp with P(a, r) > there exists a subset of the 
vertices Vq C V, with \Vo\ "not too large", such that P(a,r) = -P(c / , r') for a',r' G flp 
with o~(v) = o~'(v) and r(v) = t'(v) for all v G Vo. An analog formulation can be 
found for Markov chains for the RC model. This property is especially interesting 
for computational reasons, because it enables efficient computation of the transition 
probabilities and therefore efficient simulation of the Markov chain. From the local 
dynamics of this section only the one for the Potts model satisfies the finite range 
interaction property. 

2.4.1 Dynamics for the Potts model 

The heat- bath dynamics We begin with the definition of the local Markov chain 
for the Potts model. This Markov chain, namely the heat-bath dynamics (or Glauber 
dynamics), was introduced by Glauber [27] in 1963 for the Ising model and, since then, 
became the most studied Markov chain for the g-state Potts model (especially for q — 2). 
The one-step transitions can be described as follows. Suppose that the current state of 
the Markov chain is a G flp. Then: 

HB1) Choose a vertex v G V uniformly at random. 

HB2) Let Q a}V := {r G f2p : t(u) = o~(u),\/u ^ v} and choose the next state of 
the Markov chain, say r, with respect to 7r(- | £l av ), that is the conditional 
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probability (w.r.t. n) given that r differs from a only at v. 
For u G Qp, k G [q] and u,v & V, let 

i [ if m 7^ f , 

[/c, it U = V , 

be the configurations that differ from a at most at v G V and let 

dv,k(c) '■= \{e e E : a(u) = k for {u,v} = e}| 

be the number of edges in G connecting v to vertices with color k in a. It is easy to 
see that the set from (HB2) can be written as Q a ^ v = [J 9 i =1 {o' v ' 1 }, and that the 
conditional probabilities satisfy 

<° v ' I n ^ = E?=i = jx^^w (2 - 17) 

and 7r(r | = 0, whenever r 7^ o" 11 ^ for all k. Therefore we can write the transition 

matrix of the heat-bath dynamics on f2p as 

Phb(<i,t) := Pg BAq (<T,r) = -L ^ l(rgO ff , c ). (2.18) 

Clearly, P H b is reversible with respect to n and, as long as (3 < 00, also ergodic. 

An interesting feature of this simple construction of a Markov chain is that the (tem- 
poral) mixing properties of the heat-bath dynamics are proven to be equivalent to some 
spatial mixing properties of the associated Potts measure if the underlying graph is a 
rectangular subset of the (physically most relevant) d- dimensional integer lattice Z d , see 
e.g. [1, 22, 32, 45, 46, 47, 48, 63] or the survey of Martinelli [44]. Similar results are 
known for trees [51]. It turns out that certain spatial properties of the model imply 
tight bounds on the spectral gap (or mixing time) of the heat-bath dynamics. In Sec- 
tion 2.5 we will state some of these results, especially for the two-dimensional square 
lattice where we have an almost complete characterization of the mixing properties. 

One drawback of the heat-bath dynamics is that it is typically slowly (i.e. not rapidly) 
mixing at low temperatures (large (3), see e.g. [6], [7], [37] and [62]. We refer again to 
Section 2.5. In fact, at zero temperature (/3 = 00) the heat-bath dynamics is not even 
irreducible, although this is the simplest case: uniform distribution on q (constant) 
configurations. 

In the next paragraph we introduce the Swendsen-Wang dynamics, which overcomes 
the slow mixing behavior at low temperatures. This Markov chain is the primarily 
studied Markov chain of this thesis. 
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The Swendsen-Wang dynamics Inspired by the representation of the Potts mod- 
els of Fortuin and Kasteleyn [24], Swendsen and Wang [65] invented a Markov chain 
around 1987, that makes large steps through the state space and is surprisingly simple 
to implement. We call this Markov chain, which is presently the best candidate for the 
most efficient dynamics to sample from the Potts model on general families of graphs, 
Swendsen-Wang (SW) dynamics. 

As stated several times, this Markov chain is based on the connection of the random- 
cluster and Potts models that is given by the coupling of the corresponding measures of 
Edwards and Sokal [23]. 

First, recall that this coupling is given by the FKES measure (2.12), i.e. 

v{<t, A) = i (r^y ' t{AcE(a)), aen P ,Aen RC , 

with E(a) from (2.10) and some p G [0,1], and that the conditional probabilities 
(w.r.t. v) of a given A (or of A given a) satisfy 

v {a | A) = q- c ^ l(Ac E(a)) (2.19) 

and 

u(A\a) = p lAl (l-p) ma)HA] t(AcE(a)), (2.20) 

see (2.15) and the discussion thereafter. 

The Swendsen-Wang dynamics (for the Potts model) makes use of these conditional 
probabilities in such a way that, given the current state a G flp, the next state r G flp 
is generated by first sampling a A C E from u(-\a) and then, sampling r from u( ■ \A). 
Thus, the transition matrix of the Swendsen-Wang dynamics for the g-state Potts model 
on G at inverse temperature = — ln(l — p) is given by 

P sw (a, r) := Pjfa Aq (cr, r) = < A I °) < T I A ) 

Age 

/ \ 1^1 ( 2 -21) 

= (i - p)i^)i ( -rr~ ) ^ C{A) 1 ( A c E ( a ) n E ( T )) ■ 

ace ^ ^ ' 

Using the same interpretation of this sampling with respect to the conditional proba- 
bilities that was given in Section 2.3, we can describe one step of the Swendsen-Wang 
dynamics as the following two-step procedure: 

SW1) Given a Potts configuration a G Qp on G, delete each edge of E(a) indepen- 
dently with probability 1 — p = e' 13 . This gives A G Qrc- 

SW2) Assign a random color from [q] independently to each connected component 
of (V,A). Vertices of the same component get the same color. This gives 

t G Op. 
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Ergodicity of this Markov chain is easy to check. For this, imagine that the generated 
edge set A of the first step (SW1) is the empty set (this happens with probability 
(1 - P )\ E ^\ > for p < 1), then, in step (SW2), every possible configuration can be 
generated with probability q~^. Hence, Psw(c, T ) > for all a, r e fip, which implies 
ergodicity. If p = 1 (/? = oo), then the only elements of Qp with positive measure are the 
q constant configurations, for which we have Psw(o", t) — Q 1 > 0. For the reversibility 
of Psw with respect to tt note that 7r(er) = 4(1 —p)~^ E ^, since p = 1 — e _/3 . We obtain 
from (2.21) that 

vr(a) P SW (<7, r ) = i ^ (1 - g" c ^ 1 (A C P(<x) n £(r)) , 

AcE 

which is symmetric in a, r. This shows that the SW dynamics is reversible and ergodic 
for every G, q and /3, but, in contrary to the heat-bath dynamics, this Markov chain 
seems to be efficient also for large (3. For results that this is true for large enough (3 see 
e.g. [34] and [43]. Additionally, there are plenty of numerical results suggesting that the 
Swendsen-Wang dynamics is rapidly mixing at all temperatures in various cases. These 
include the Ising model and the 3-state Potts model on the two-dimensional square 
lattice [19, 61] and the Ising model on the three-dimensional cubic lattice [55]. 

However, rigorous proofs of such statements are rare. There far as we know, 

presently only four cases where rapid mixing at all temperatures is proven. These are 
on trees, cycles [11, 34, 40] and narrow grids [10] for all q G N and on the complete 
graph for q = 2 [40, 41]. One goal of this thesis is to add the Ising model (q = 2) on the 
two-dimensional square lattice to this list. 

There are also some rigorous results regarding slow mixing. We state three of them: Li 
and Sokal [39] proved that the inverse spectral gap is larger than some constant times the 
specific heat (that is proportional to the second derivative with respect to /3 of the Potts 
normalization constant Z, see (2.9)). This shows (at least) that the inverse spectral gap 
cannot be bounded independently from the size of the graph in many cases of interest. 
Furthermore, Gore and Jerrum [28] showed slow mixing of the SW dynamics on the 
complete graph for q > 3 for some value of the inverse temperature. Recently, Borgs, 
Chayes and Tetali [5] gave tight upper and lower bounds on the spectral gap (or mixing 
time) for SW on rectangular subsets of Z d , d > 2, with periodic boundary conditions 
at some (critical) temperature for q large enough, that show that the spectral gap is 
exponentially small in the size of the graph. Since we need some of the above results in 
the rest of this thesis we state them more detailed in Section 2.5. 

Apart from the results given above, the Swendsen-Wang dynamics resisted a detailed 
analysis in (physically) relevant cases, where the heat-bath dynamics is well understood. 
For instance, it was even not proven that SW is rapidly mixing for the Ising model on 
the two-dimensional square lattice for all temperatures above the critical one, which is 
known for the heat-bath dynamics for nearly 20 years [46, 47]. 
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We will prove this statement in Chapter 3 by comparison with the heat-bath dynamics. 
Furthermore we obtain rapid mixing at all temperatures below the critical one. 

The main challenge in proving rapid mixing of the Swendsen-Wang dynamics for the 
two-dimensional square lattice is the low temperature (large (5) regime, where no Markov 
chain is proven so far to be rapid down to the critical temperature. Therefore, we need 
to consider dynamics for the corresponding random-cluster model that seems to be (and 
we will see that it indeed is) more convenient for relating spectral gaps at high and low 
temperatures. 

2.4.2 Dynamics for the random-cluster model 

We begin this subsection with the definition of a non-local Markov chain for the random- 
cluster model. This is again the Swendsen-Wang dynamics. Since it is defined by means 
of conditional probabilities with respect to the FKES measure v (2.21), this Markov 
chain can be naturally defined also for the random-cluster model. 

The Swendsen-Wang dynamics for the random-cluster model is based on the same 
connection that was given in the last subsection, but here we make the two-step proce- 
dure in reverse order: 

SW1) Given a random-cluster configuration A £ fi^c on G, assign a random color 
from [q] independently to each connected component of (V,A). Vertices of 
the same component get the same color. This gives a £ Qp. 

SW2) Take E(a) and delete each edge independently with probability 1 — p. This 
gives the new RC configuration B £ f2 R( > 

Thus, the transition matrix of the Swendsen-Wang dynamics for the random-cluster 
model on G with parameters p and q is given by 

P SW (A,B) :=P S G W , M (A5) = J2 "(v\A)v(B\a) 

(2.22) 

(l-p) lE((7)l t{AUB C E(a)). 

Ergodicity and reversibility of this Markov chain can be shown by similar ideas as 
above. Although the Swendsen-Wang dynamics for the RC model appears quite often in 
the literature (especially its generalization to non-integer values of q, see e.g. [8, 9, 13, 14]) 
we are not aware of any attempt to proving mixing properties of this Markov chain. The 
following lemma shows however that every result on the spectral gap of the SW dynamics 
for the Potts model is also valid for SW for the corresponding random-cluster model. 
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Lemma 2.6. Let Psw (resp. Psw) be the Swendsen-Wang dynamics for the random- 
cluster (resp. q-state Potts) model with parameters p and q (resp. at inverse temperature 
(3 with p = 1 — e~^ ). Then 

A (Psw) — A (Psw)- 
Proof. Define the Qp x fi^c-matrix C by 

C(a, A) := u(A | a) = o G Qp, A G Q RC . 

Regard C as an operator that maps from L 2 (/-0 to L^it), cf. (2.4), by 

Cg{a) = C (^ A )9(A) 

Aen RC 

for g G R^ 110 . By the usual definition of the adjoint of an operator between different 
Hilbert spaces, see e.g. [35, Def. 3.9-1], the adjoint operator C* of C satisfies 

(C*f,g), = (f,CgU 

for all / G M Qp and g G M nRC . Thus, the matrix corresponding to C* is given by 

The definitions of the Swendsen-Wang dynamics of (2.21) and (2.22) imply Psw = CC* 
and Psw = C*C. Additionally, we define S n (a,r) := 7r(r), S^A.B) := n(B) and 
Sfan) (a, A) = fi(A) for all a, r G Qp and A, B G ^rc- It is easy to verify that 
S* M (A,a) = tt(o-), a G fi P , A G fi RC . Thus, S^S^ = S v and S^S^ = S„. We 
obtain that CC* — SV = (C — S( n ^)(C — S^^))* since by definition S^^C* = SV = S* = 
CSf n x, see (2.15) and below. Analogously we obtain C*C — >S M = (C — S , ( 7I - iM ))*(C — S( nifJ ,)). 
Recall the definition of the spectral gap from (2.8). It follows from simple properties of 
the norm of (adjoint) operators between Hilbert spaces, see e.g. [35, Thm. 3.9-4], that 



1 — A(Psw) — 1 1 Psw — SVIL — ||(C — S^^))(C — 5 , ( 7ri/i ))*|| jr 

= ||Psw — = 1 — A (Psw)j 
which proves the statement. 

□ 
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The single-bond dynamics The last kind of Markov chains we want to consider are 
the local Markov chains for the random-cluster model. In fact, we need more than one 
construction in this case. This comes from the fact that the two dynamics, namely heat- 
bath and single-bond dynamics, both have properties that are needed for the further 
analysis. 

First of all we introduce the heat-bath (HB) dynamics for the random-cluster model. 
Similarly to the heat-bath dynamics for the Potts model, suppose that A G Q^c is the 
current state of the Markov chain. Then: 

HB1) Choose an edge e G E uniformly at random. 

HB2) Choose the next state of the Markov chain, say B G ^rc, with respect to 
| {A U e, A \ e}), i.e. the conditional probability (w.r.t. pi) given that B 
differs from A only by e. 

We use A U e instead of A U {e} (respectively for D, \) throughout this thesis. Recall 
that, for A G SIrc, we write u-^v if u, v G V are connected in the subgraph (V,A) 
and that we denote the endpoints of e G E by and e^ 2 \ Write u & v if u and v 
are not connected in (V, A). If we additionally define A e := A e with the symmetric 
difference 0, i.e. 

A e = { T ' (2.23) 

[A\e, if e G A, 

we can write the transition probability matrix Phb := -Phb P9 °f the heat-bath dynamics 
for the RC model as 

P M A,B ) = ^^0L_nB\^A\ e) . (2.24) 

Note that unless A = B at most one summand in the above sum is not zero. Using the 
definition of fi from (2.11) we obtain 



H(A) / ^ 

A\e 



q c{A)-c(A°) : (2.25) 



where c(A) = c(A e ) if and only if e (1) <— ^ e (2) . Otherwise, c(A) -c(A e ) = 2 l(e ^ A) -I. 
Hence, Phb satisfies 



P, 



Pub(AB) = t^E 



if B = AUe and e« A e^ 2 \ 
l-p, if 5 = A\e and e« A e<- 2 \ 
if P = AUe and e« 4^ e^, 



v 



(2.26) 



P+?(1-P) ' 

jfc£L if b = A \ e and e« <4H e ( 2 \ 

V p+q(l-p) ' \ 
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for A,B G ^rc- We immediately obtain from this equation that Phb does not satisfy 
the finite range interaction property, since we have to check connectivity of two vertices 
in each step, which is in general a non-local property. In addition to the need of this 
Markov chain in an intermediate step of our proof of rapid mixing for the Swendsen- 
Wang dynamics, the heat-bath dynamics is interesting in its own right. One reason 
is that it provides also a Markov chain for the random-cluster model with non-integer 
values of q > 0. It would be interesting to prove a similar relation between temporal 
mixing properties of Phb and some "spatial mixing" property in the RC model, as it is 
known for single-spin dynamics for the Potts model. Another reason is that the heat- 
bath dynamics was proposed by Propp and Wilson [56] to produce exact samples from \i 
(and so also from the Potts model) using their famous coupling from the past procedure. 
Therefore, every mixing time bound on Phb leads to a bound on the (expected) cost of 
their procedure. 

To the best of our knowledge, there is presently only one result on mixing of local 
dynamics for the RC model. This is the upper bound on the mixing time of Ge and 
Stefankovic [25] that shows rapid mixing of a Metropolis-type version of Phb for every 
p G (0, 1) and q > if the underlying graph has bounded tree-width. We will state this 
result later, see Theorem 2.19. 

The advantage of the heat-bath dynamics for our purposes is that its spectral gap, if 
the underlying graph G is planar, can be shown to be the same as the spectral gap of 
the heat-bath dynamics for the RC model on the dual graph of G with suitable values 
of the parameters p and q, see Chapter 5. This provides us with a relation of the 
mixing properties of the Swendsen-Wang dynamics at high and low temperatures, see 
Theorem 5.6. 

The second local Markov chain that we consider is inspired by the local behavior of 
the Swendsen-Wang dynamics and we will see (Theorem 4.8) that it enables us to give 
lower and upper bounds on the spectral gap of the SW dynamics in terms of the spectral 
gap of this Markov chain. The aforementioned local behavior can be demonstrated by 
the following example. Fix p G (0, 1), q G N and let the graph be given by G\ = 
({u, v }, {{w, v }}) , i.e. the graph with two vertices and one edge between them. Let 
e = {u, v}, then it is easy to verify that the transition probabilities of the SW dynamics 
satisfy P g ^(0, e) = p/q and P|^(e, 0) = 1 — p. For this note that, following the steps 
(SW1) and (SW2), if the current configuration is C {e} we assign independently a 
random color from [q] to each of the vertices and, if their color agree (which happens with 
probability 1/q), put an edge between them with probability p. Hence, P^(0, e) = p/q. 
The second equality can be shown analogously. We use these transition probabilities 
to construct a local Markov chain for the RC model on arbitrary graphs, which we call 
the single-bond dynamics, as follows. Let the current state be A C E, choose an edge 
e G E uniformly at random and decide if e shall be included in the new configuration 
or not depending on the connectivity of the endvertices of e in A. If the endvertices 
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are not connected in A include e with probability P s ^(0, e), otherwise include e with 
probability P s ^(e, e). Note that in the heat-bath dynamics, we check connectivity in 
A \ e instead of connectivity in A. 

The transition matrix of the single-bond (SB) dynamics is given by 



1 1 eS-E 



Ergodicity of Psb is clear. For the reversibility with respect to fi it is enough to prove 

P SB (A\e,AUe) = u.{A U e) (2.25) f^, if e« Ae( 2 \ 
? S B(AUe,i\e)""/i(i\e) " |_^_ ) ife (D^ e (2) ) 

for every A e f2^c and e <E E. First note that and e^ 2 -* are always connected in AUe, 
hence we divide by Psb (A U e, A\e) = 1 — p independently from A and e in the above 
equation. Additionally, the enumerator in the above fraction is p or p/q depending on 
connectivity in A \ e, as desired. 

As for the heat-bath dynamics for the RC model, the single-bond dynamics is also a 
valid local Markov chain for non-integer values of q, but here we have to assume q > 1 
in order to ensure that Psb is well-defined. 

In Section 4.1 we present the usefulness of this dynamics by providing a common 
representation of this and the Swendsen-Wang dynamics on the joint (FKES) model, 
using the same "building blocks" . We finish this section with an inequality between the 
spectral gaps of Psb and Phb- 

Lemma 2.7. For Phb and Psb for the random- cluster model with parameters p and q 
we have 

(l-p(l-^jj A(Phb) < A(Psb) < A(Phb). 



P, 

1 -p, 

p 

1-2 



if B = A U e and e (1) 4 



■ ,(2) 



if B = A \ e and e« <- 
if B = A U e and e« 



e 



(2) 



if B = A \ e and e« < 



A 



(2.27) 



-><-»■ e 



(2) 



Proof. First we show that Phb and Psb have only non-negative eigenvalues. For this 
write Phb = ^ Zl ee i?-Pe with 

see (2.24). Obviously, P e , e e P, is reversible with respect to // and satisfies P 2 = P e , 
since the distributions P e (A \ e, •) and P e (A U e, ■) are equal. This shows that all P e , 
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e E E, are projections, see [35, Thm. 9.5-1], and thus positive, i.e. (P e g,g)^ > for all 
g G L 2 (fi), see [35, Thm. 9.5-2]. Using that the sum of positive operators is positive we 
obtain positivity ofPn B . It follows that Phb has only non- negative eigenvalues, see [33, 
Obs. 7.1.4]. Similar arguments lead to the same statement forPs B , see Remark 4.4. By 
Lemma 2.5, and since Phb andPsB have the same stationary distribution, it is sufficient 
to show 

l-p(l--))p nB (A,B) < P SB (A,B) < P HB (A,B) 



ioiA^Be Q RC , i.e. for B = A e for some e G E (otherwise P BB {A, B) = P SB (A B) = 0). 
By reversibility we obtain 



P HB (A\e,AUe) P HB (^Ue,A\e) (2.27) \E\ 



P SB {A\e,AUe) P SB (AUe,A\e) l-p 



P HB (AUe,A\e). 



Since g<P H B(AUe,A\e)<^ for q > 1 it follows 



1< Phb{A 1 A^ < q _ 1 (2 28) 

~ P SB (A^ e ) " P + ?(l -P) l-pil-q- 1 ) 

for all A G Qrc and e G E. This finishes the proof. □ 



2.5 Known results 

In this section we present a selection of known results on the mixing properties for the 
above introduced algorithms. In fact, for the heat-bath dynamics for the Potts model, 
we will state only results that are needed for the further analysis. For the Swendsen- 
Wang dynamics and the local Markov chains for the random-cluster model we try to 
give a complete overview of the known results. 

Since, in the original papers, some results are given in terms of spectral gap and some 
by mixing times, we first state a corollary to Lemma 2.3 that we need for translation. 

Corollary 2.8. Let P (resp. P) be the transition matrix of a reversible, ergodic Markov 
chain for the q-state Potts (resp. random- cluster) model on a graph G = (V, E) at inverse 
temperature (3 (resp. with parameters p and q). Then 

A(P)- 1 -! < t mix (P) < (2 + /3 1^1 + 1^1 log?) A(P)- 1 

and 

A(P)- 1 -! < t mix (P) < (2+|£|log 1 +\V\\ogq^j \{P)-\ 
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In particular, this shows that every result of this thesis can also be written in terms 
of the mixing time, loosing the same factor as in Corollary 2.8. 

We begin with the probably most studied instance: the heat-bath dynamics for the 
g-state Potts model on the two-dimensional square lattice. For d > 1, define the d- 
dimensional hypercubic lattice Z d L of side length L as the graph Z d L = (Vz, )£ j, E^d) with 
vertex set Vh,d = {!,-•• , L} d C Z d and edge set Ei,d = {{u,v} C Vi,d '■ \u — v\ = lj, 
where | • | denotes the Euclidean norm. For d = 2 we call Z| the two-dimensional square 
lattice of side length L. In this case (d = 2) there is an almost complete characteriza- 
tion of the spectral gap, that was established over the last decades, especially for the 
Ising model (q = 2). Beginning with the work of Holley [32], Aizenman and Holley [1] 
and Stroock and Zegarlihski [64], who showed rapid mixing of the heat-bath dynamics 
given some spatial mixing property, which is called complete analyticity (or Dobrushin- 
Shlosman mixing condition), see [18], it was finally proven by Martinelli and Olivieri 
[46, 47] that the heat-bath dynamics for the Ising model is rapidly mixing up to the 
critical temperature, i.e. if the inverse temperature (3 satisfies < (3 C (2) = log(l + y/2). 
Using results of Cesi, Chayes, Chayes, Guadagni, Martinelli and Schonmann [7, 6, 62] 
it is known that this is (almost) best possible in the sense that the spectral gap of the 
HB dynamics on 1? L at inverse temperature (3 is smaller than exp(— cL), for some c > 0, 
if /3 > /3 C (2). Only recently rapid mixing at the critical temperature /3 C was proven by 
Lubetzky and Sly [42] . For the proof of rapid mixing of the heat-bath dynamics for the 
Potts model at all (3 < log(l + ^/q) we need exponential decay of connectivities in the RC 
model, see Beffara and Duminil-Copin [3]. This implies weak mixing in the Potts model, 
see Alexander [2], and thus, rapid mixing of the heat-bath dynamics, see Martinelli, 
Olivieri and Schonmann [48]. 

Before we summarize these results in Theorem 2.10 we introduce a variant of the Potts 
measure (2.9) and the heat-bath dynamics (2.18) with (constant) boundary condition. 
For this define the boundary of Vh,d by 

dV L)d := (v = (v u ...,v d )e V L)d : v { G {1, L) for some i G [d]\ 
and let d1[ d (v) be the number of neighbors of v in Z d \ Vi,d, i-e., 

4» : = \{ueZ d \V L4 : \v-u\ = l}\. 
The Potts measure on Z^ with 1-boundary condition is defined by 

*S'V) Z-^Tfr) II exp(/3 4»l(a(^) = l)), aeQ P (Z d L ), (2.29) 

vedv L , d 

— x d 
where Z is the proper normalization constant and 7ip L is defined as in (2.9). This 

measure can be interpreted as the conditional distribution of the configurations on Vi^d 

given that all vertices of Z d \ have color 1. 
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Remark 2.9. The critical inverse temperature (3 c (d, q) for the q-state Potts model on 
Z d , that was cited above for d = 2, is generally defined by 

(3 c (d,q) : = inf{/3: M d ,M > 0}, (2.30) 

where 

M *»<« : = WA £ - <*0 = !>) - ;) • 

(72 is well-known that these limits exist, see e.g. Grimmett [29].) We write f3 c (q) for 
c (2,q). A closed formula for (3 c (q) was first established by Onsager [54] in the case q = 2 
by giving an explicit formula for M^^iP)- A proof of the equality fi c (q) = log(l + yfq) 
for all q >2, which was expected to be true, has been given only recently by Beffara and 
Duminil-Copin [3]. For d > 3 it is still a challenging open problem to give an explicit for- 
mula for the critical inverse temperature. However, it is known (see Laanait et al. [36]) 
that 



P c (d,q) = -logq + Oiq' 1 ^) 



for q large enough. 



7? 1 

Let % := Tia L ' . Similarly to (2.18) we define the transition matrix of the heat-bath 



dynamics for the Potts model on T? L with 1-boundary condition by 

Phb>,t) : = P|| 1A? (a,r) = -L, £ ^ l(rg^) (2.31) 

for all a, r G fip(Z|). We summarize the rapid mixing results stated above in the 
following theorem. 

Theorem 2.10. Let Phb be the transition matrix of the heat-bath dynamics for the 
q-state Potts model on Z| at inverse temperature (3. Let n = L? = (V^l- Then there 
exist constants cp = cp(q) > and C < oo such that 

A(Phb) > C± forf3<(3 c (q) 

A U 

ana 

A(Phb) > n~ c forq = 2 and [3 = &(2), 
where (3 c (q) = log(l + yfq). These bounds hold also if we replace Phb by Phb,i- 

Proof. The results, as originally given in [42, 46], refer to a continuous-time Markov 
process for the Potts model. See e.g. [44] for an introduction to the "Graphical con- 
struction" of the continuous-time heat-bath dynamics for the Ising model. In fact, they 
present lower bounds on gap(Z^) which is defined by 



™™ : = A, 2 £ £ £ ^ m 



ir(a v ' 1 ) 

Var„(/): ' 1 
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cf. [48, Sec. 3]. By the variational characterization of the eigenvalues of reversible tran- 
sition matrices (see e.g. [16] or the proof of Lemma 2.5) we can write 

1 - 6 = inf E <°) P H B (a, r) (/(r) - /(a)) 2 , 

Vaiv(/)=1 CTen P ref7 P 

where £2 is the second largest eigenvalue of Phb- By definition, gap(Z|) = n{\ — £2), see 
(2.18). But Phb has in general only non-negative eigenvalues and thus, A(Phb) = 1 _ £2- 
To see this, write Phb = r^r Suev with P v (cr, r) = 7r(r \ Q a>v ), cf. (2.17), and note 
that P 2 = P v since VL a ^ v = VL TyV for all cr, r with P v (a, r) > 0. That is, P v is a projection, 
see [35, Thm. 9.5-1]. It follows that all P v , v G V, and thus Phb, have only non-negative 
eigenvalues by [35, 9.5-2]. Similar ideas were used in the proof of Lemma 2.7. 

Hence, it is enough to show that there exist constants cjg > and C < 00 such that 

• gap(Z|) > c,3 for < c (q), 

• gap(Z|) > n- 5 for g = 2 and /3 = (3 C {2). 

The second inequality is proven in [42, Thm. 4.2]. By [48, Thm. 3.2] the first inequal- 
ity is equivalent to a weak mixing property of the Potts measure (see [48, eq. (1.11)]). 
This weak mixing property is shown [2, Thm. 3.6] to hold whenever correlations de- 
cay exponentially or, equivalently, we have exponential decay of connectivities in the 
corresponding infinite- volume random-cluster model, i.e., for all u,v G Z 2 we have 

lim i4l({A C E L2 : uAv}) < a^- 02 ^ 

with some < ai(p, q), ^(p, l) < 00 and Euclidean norm | ■ |. This was proven by 
Beffara and Duminil-Copin [3, Thm. 2] for all q > 1 and p < p c (q) := 1 — e~ /3c( -' ? - ) . Fur- 
thermore, the statements of this theorem hold true if we consider the case of 1-boundary 
condition. For this consider [42, Thm. 1] and note that the result [48, Thm. 3.2] holds 
for arbitrary boundary conditions. This proves the theorem. 

□ 

Remark 2.11. As stated above it is known that the heat-bath dynamics for the Ising 
model (without boundary conditions) is slow mixing if (3 > /3 C (2). Additionally, we are 
only aware of a result that shows an exponentially small upper bound on the spectral 
gap if (3 > (3 c (q) and q is large enough for some specific (periodic) boundary condition, 
see [5, Thm. 1.2] or Theorem 2.12 below. However, it is reasonable to believe that the 
HB dynamics is slowly mixing for all q > 2 and (3 > fi c (q) on Z| without (or periodic) 
boundary condition. 
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We now turn to another class of underlying graphs, namely to rectangular subsets of 
the hypercubic lattice Z d . In fact, we consider only the case of periodic boundary condi- 
tion. For this consider the cycle Cl of length L, that is the graph Cl = ({1, • • • , L}, E L ) 
with El '■= {{v,v + 1} : v G {I, . . . , L — 1}} U {1, L} and define, for two graphs 
G\ = (Vi,Ei) and G 2 = (V^-E^), the graph product of G\ and G2, written G\ x G 2 , as 
the graph with vertex set V\ x V2 and (ui, U2), (fi, v 2 ) G V\ X V2 are neighbors in G\ x G 2 
iff either u\ and v\ are neighbors in Gi and w 2 = t> 2 or w 2 and t> 2 are neighbors in G 2 
and Mi = V\. We then define the d-dimensional torus /L d L of side length L by the d-fold 
graph product 

Z d L := C d L = C L x---x C L . (2.32) 
From Borgs, Chayes and Tetali [5] we obtain the following theorem. 

Theorem 2.12. Let Phb (resp. Psw) be the transition matrix of the heat-bath (resp. 
Swendsen-Wang) dynamics for the q-state Potts model on % d L , d>2, at inverse temper- 
ature (3. Then there exist constants ki,k 2 < 00 and a constant k 3 > (depending all on 
d, (3 and q) such that, for q and L large enough, 

and 

witt P c (d,q) from (2.30). In /act, £/ie lower bounds hold for all (5, q and L. 

This theorem shows (at least for large q) that the heat-bath dynamics is slowly mixing 
at and below the critical temperature and, additionally, that also the Swendsen-Wang 
dynamics has an exponentially small spectral gap at the critical temperature if q is large 
enough. We will see in Chapter 4 that an analogous result holds for the single-bond 
dynamics for the random-cluster model. 

Before we discuss other results for the Swendsen-Wang dynamics, we state a result 
for the HB dynamics on a more general class of graphs. For this, fix a graph G = (V, E) 
and define the adjacency matrix A G of G by Aq(u,v) := l({w,f} G E), u,v G V. 
Additionally, write \\A G \\ for its (unweighted) operator norm, i.e. \\A G \\ = max^^x |v4 G x|, 
where the maximum is taken over x G K' y ' and | • | is the Euclidean norm. In the 
literature \\Ac\\ is called the principle eigenvalue of the graph G. The following theorem 
is based on a result of Hayes [31]. 

Theorem 2.13. The heat-bath dynamics for the q-state Potts model at inverse temper- 
ature (3 on a graph G with n vertices satisfies 

A(Phb) > — - 

n 

whenever (3 < 2e \\AqW~ 1 ■ 
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Proof. Note first that one key ingredient for the result is Observation 11 of [31]. This 
result is stated only for the Ising model (q — 2), but it can be generalized quite easily 
by induction on q. We state the proof here for completeness. For this, we have to show, 



for all u, v G V, that 



Pu,v < tanh 



A G (u,v), 



where p U)V , i.e. the influence of v on u, is defined by 



1 



Pu; 



max 
o-eHp 2 



max 
ue!2 P 2 

T&tt a „ k=l 



k= 
1 Q 

IE 



u,k\ 



7T T 



u,k\ 



7r r 



u,l) 



o 0d Utk (cr) 



E 



i=i 



ELi' 



see [31, Def. 4]. As before, Q aM is the set of configurations that differ from a only at v , 
cf. (HB2) on page 19, and d u ^{o~) is the number of neighbors of u in G with color k in 
a. Note that in [31] the above bound is stated with (3 in place of /3/2. This comes from 
the difference in the normalization of the measure. 

Obviously, p U)V = if {u, v} ^ E since the term inside the absolute value depends only 
on the colors of the neighbors of u, which are equal if v is neither of them. Now fix some 
neighbors u,v G V and configurations a, r G Qp with r G ft a ,v Let r k := d u ^(o~). Since 
a and r differ only at one neighbor of u, there exist i,j G [g] such that d u ^{r) = + 1, 
du,j( T ) — r j ~ 1 an d d U)k (r) = r k for all 7^ i, j. Assume w.l.o.g. that z = 1 and j = 2. 
Then some simple calculations show that 



1 



fe=i 



?Pd Uyk (r) 



E 



El! 



,/3d u ^(r) 



N q (r) 
D q (r) 



(n, • • .,r ff ), 



where 



and 



iV ? (r) 



/3-j e /3(ri+r 2 ) 



+ max< (e 



l)e 



/3ri 



D g (r) 




1) + e Kr2- 



~ 1)+ E 



2=3 



fc=3 



We will prove that N q {r)/D q {r) < tanh(/3/2) for all reR 5 and g e N by induction. Let 
us first recall the q = 2 case from Observation 11 of [31]. In this case the last sum in 
the definition of N 2 and D 2 disappears. Thus, for r£l 2 , 

N 2 (r) 



- e" /3 )e /3(r ' 1+r2) 



-e-V 



< 



e? - e-P 
el 3 + C-P + 2 
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where the last inequality comes from e x ' + e x > 2, x G R. Now assume that the 
statement holds for q — 1 and all s = (r 1; . . . , r g _i) G R 9_1 and let r = (r 1; . . . , r g _ 1; r g ) 



for some r q G R. We obtain 

N q (r) _ iV f ,_ 1 (s) + e /3r 'max{(e /3 -l)e /3ri ,(l-e- /3 )e /3r2 } 

I> g (r) ~ £) g _i(s) + (e/ 3 + 1) ^+^> + (1 + e^) ^+^> + e 2 ^ + 2e" r * ELs^ 
iV„_i(s) + e" p « max{(e" - l)e /3ri , (1 - e - /3 )e /3r2 | //3 
" L> 9 _ 1 (s) + e^[(e^ + l)e^ + (l + e-^)e^] " V 2 

since a/6 < t and c/d < t imply (a + c)/(b + d) < t for a,b,c,d,t > 0. This proves 
Observation 11 of [31] for all g G N and thus, under the assumptions of this theorem, 
we have by [31, Thm. 6] that 

max IIPurCct, •) — 7r|L,. < nil 



tren P (G) " HBV "' 1 "Htv - v 
Using Lemma 2.2 we obtain the result. 

□ 

In particular we obtain the following corollary, see [31]. 

Corollary 2.14. The heat-bath dynamics for the q-state Potts model at inverse temper- 
ature (3 on a graph G with n vertices and maximum degree A satisfies 

A(Phb) > ' ~ : 



n 

ifP < f ■ 

Proof. Using [33, Thm. 8.1.22] we obtain \\Aa\\ < max u6 y J2 v &v Ag(u, v ) = A(G). 
Thus, the result follows from Theorem 2.13. □ 

In [31] one can find also an improvement of this corollary if we restrict to the class of 
planar graphs. We will use this in Chapter 5 to obtain a result for the Swendsen-Wang 
dynamics on planar graphs, see Corollary 5.7. 

Remark 2.15. There are, of course, a lot of other results concerning mixing properties 
of the heat-bath dynamics for the Potts model. These include, e.g., rapid mixing for heat- 
bath dynamics on trees at all temperatures [4J. Additionally, only recently the complete 
picture of rapid mixing or not was established for the HB dynamics on the complete 
graph [12, 38]. 

Now we want to state some known results on the mixing properties of the Swendsen- 
Wang dynamics. We try to give an overview of all known results, but we omit results 
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that involve boundary conditions (or external magnetic field), see e.g. [43, 49, 50], and 
results where the underlying graph is random, see [11]. 

We start with the known results on the complete graph, where at least for q = 2 
the mixing behavior is completely known. Denote by K n the complete graph on n 
vertices, i.e. K n = ([n], (^)), where is the set of all two-element subsets of [n]. 
Note that, for two non-negative functions /, g : N — > R, we write f(n) = Q(g(n)) iff 
< lim^oo < oo. The result below is due to Long, Nachmias and Peres [41, 40] in 
the case q = 2, see also [10]. For q > 3 the results are adopted from Gore and Jerrum [28] 
and Huber [34]. 

Theorem 2.16. Let P^w be the transition matrix of the Swendsen-Wang dynamics for 
the q-state Potts model on the complete graph K n at inverse temperature (3 = log (-3-), 
c > (or p = 1 - e _/3 = ~). Then, with t mix := t mix (Psw), 

(i) for q = 2 and 

• c< 2: t mix = 0(1). 

• c = 2; t mix = 6(^/4). 

• c> 2; t mix = e(log(n)). 

(ii) for q > 3 and 

• 2(g-l)log(q-l) 

q-2 

• C <\ : 

• c> 2glog(3gn): 

As far as we know, Theorem 2.12 and Theorem 2.16(h) contain the only presently 
known slow mixing results for the Swendsen-Wang dynamics (except for results on ran- 
dom graphs) . Other classes of graphs where rapid mixing of SW is known at all tem- 
peratures, but now for every gGN, are trees and cycles, see Cooper and Frieze [11] and 
Long [40]. 

Theorem 2.17. Let T be a tree on n vertices and C n be a cycle of length n. Then 

• t m ix{Psw,p,q) = Q(log(n)) and 

• Uix( J P s < w,/3,« ? ) ^ cnlog(n) for some c < 00. 

In fact, we have A(P s 1 Wi/3 g ) = 1 - p(l - ±). 

For the statement on the spectral gap consider the construction of [40, Chap. 7] and 
Exercise 12.7 of [37]. The last result that we want to present here for the SW dynamics 



^mix < Clog(n), 
^mix — Cqn, 



for some e > 0. 
for some C < 00. 

for some C < 00. 
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is, similar to Corollary 2.14, a result on graphs of bounded maximum degree from [34], 
see also [11]. 

Theorem 2.18. The Sw ends en- Wang dynamics for the q-state Potts model at inverse 
temperature (3 on a graph G with n vertices and maximum degree A satisfies 

^mix(-Psw) < C\og(n), for some C < oo. 

whenever (3 < 3 ( A 1 _ 1 ) . 

Now we turn to a result for the single-bond (or heat-bath) dynamics for the random- 
cluster model. In fact, this is the only result on the mixing properties of this Markov 
chain that we are aware of. 

For this define the linear-width of a graph G = (V, E) as the smallest number I such 
that there exists an ordering ex, ... , e\E\ of the edges with the property that for every 
% E [\E\] there are at most £ vertices that are an endvertex of both, an edge in {ei, . . . e^} 
and an edge in {e i+ i, . . . e\E\}- See [25] for bounds on the linear- width of paths, cycles, 
trees and a bound in terms of a related quantity, the tree-width. 

The following result is due to Ge and Stefankovic [25] (we state only the q > 1 case). 

Theorem 2.19. Let Psb be the transition matrix of the single-bond dynamics for the 
random- cluster model with parameters p and q > 1 on a graph G = (V, E) with linear- 
width bounded by £. Let m := \E\. Then 

A(Psb) > — \ ( 2 - 33 ) 

Proof. In [25] the authors consider the (lazy) Metropolis version of the single-bond 
dynamics. This Markov chain has transition probabilities 

Pu(A, A*) = ^mm 1, q< A °)-^ [JL- J I , A C E, 

withPM(^4, A) such that Pm is a stochastic matrix and A e from (2.23). For this Markov 
chain they proof a lower bound on the congestion, which is defined as follows. Let 
T := {iab '■ A, B C E}, where jab are paths from A to B in the graph H = (^rc, £) 
with £ = {(v4,I?) : Pyi(A,B) > 0}. Then we define the congestion of Pm (w.r.t. T) by 

g(P M ,r) := max , , ~ Yl \lAc\ v(A) u.(C), 

(B 1 ,B 2 )ee f x(B 1 )P M (B 1 ,B 2 ) ac . (Bi B2) ^ ac 

where \jac\ denotes the length of the path. The bound of [25, Lemma 16] is q(Pm, T) < 
2 \E\ 2 q e for a suitable choice of T and so we obtain by [16, Prop. 1'] (note that Pm is 
lazy) that 

A^m)- 1 < 2|P|V. 
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But since it is easy to show that Pu{A,B) < qPo,^{A, B) for all A ^ B C E and that 
Psb has only non-negative eigenvalues, see Remark 4.4, we can conclude with Lemma 2.5 
that 

A(Psb)" 1 < qX(Pu)- 1 < 2\E\ 2 q e+1 . 

□ 
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This chapter is based on [66]. We prove by comparison that the spectral gap of the 
Swendsen-Wang dynamics (SW) is bounded from below by some constant times the 
spectral gap of the heat-bath chain (HB). This result leads to rapid mixing of SW on 
graphs of bounded degree whenever HB mixes rapidly. 

We will prove the following theorem, which is a minor improvement of [66, Thm. 1]. 

Theorem 3.1. Suppose that Psw (resp. Pbb) is the transition matrix of the Swendsen- 
Wang (resp. heat-bath) dynamics for the q-state Potts model at inverse temperature (3 
on a graph G with maximum degree A. Then 

A(Psw) > c sw A(P HB ), 

where 

c sw = c sw (A,/3,g) := q~ l {qe^)' 2A . (3.1) 

Remark 3.2. The inequality of Theorem 3.1 is probably off by a factor of \V(G)\, 
because we compare the SW dynamics with a Markov chain that changes only the color 
of one vertex of the graph per step. We conjecture that a bound of the form A(Pgw) — 
c|V((j)| A(Phb); f or some constant c > 0, holds and that, in particular, this constant c 
has a "better" dependence on the involved parameters. Unfortunately, this does not seem 
to be possible with our techniques. 

Before we prove Theorem 3.1 we state some corollaries that can be deduced directly 
from the known results for the heat-bath dynamics, see Section 2.5. 

The first corollary deals with the class of graphs with bounded maximum degree. It 
relies on a slight generalization of a result of Hayes [31], who gives a simple condition 
on P, depending on the maximum degree, for rapid mixing of the heat-bath dynamics 
for the Ising model, see Theorem 2.13 and Corollary 2.14. 
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Corollary 3.3. The Swendsen-Wang dynamics for the q-state Potts model at inverse 
temperature (3 on a graph G with n vertices and maximum degree A satisfies 

A(Psw) > - 



n 

with c sw = c sw (A,/3,g) from (3.1) and e > 0, if 

This result improves the result of Huber [34] (see Theorem 2.18) in the range of appli- 
cability, which was /3 < 1/ (3(A — 1)) before, but while the result of [34] is a logarithmic 
(in n) upper bound on the mixing time, Corollary 3.3 together with Corollary 2.8 lead 
only to a quadratic bound. We will see an improvement of the above result in Chapter 5 
if we consider only planar graphs, see Corollary 5.7. 

The second corollary, which we call theorem because of its importance for the rest 
of this thesis, gives a bound on the spectral gap for the Swendsen-Wang dynamics on 
the square lattice. For this recall that the two-dimensional square lattice of side length 
L is the graph 7? L = (Vl i2 , E L 2 ) with vertex set V^,2 = {1, ■ ■ ■ , L} 2 C Z 2 and edge set 
El,2 = {{u,v} C Vl,2 '■ \u — v\ = l}, see Figure 2.1. 

Theorem 3.4. Let Psw be the transition matrix of the Swendsen-Wang dynamics for 
the q-state Potts model on Z| at inverse temperature (3. Let n = L 2 = [V^l- Then there 
exist constants cp = cp(q) > and C < oo such that 

A(Psw) > CJ ~ for(3<(3 c (q) 
n 

and 

A(Psw) > c sw n- c forq = 2 and = &(2), 
with c sw = c sw (4,/3 c (2),2) from (3.1) and f3 c (q) = log(l + y/q). 

This result is due to Theorem 3.1 and the corresponding result for the heat-bath 
dynamics, see Theorem 2.10 as well as the references that are given in its proof. 

Remark 3.5. Note that only the presence of q under the exponent in the definition of 
Csw from (3.1) prevents us from the application of Theorem 3.1 to the complete graph K n . 
This comes from A(K n ) = n — 1 and the usual normalization of the inverse temperature 
to (3 = - for some c > 0, cf. Theorem 2.16, which would lead (without the q) to a lower 
bound on c sw (K n , ^,q) independent of n. 
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3.1 Proof of Theorem 3.1 

The proof of Theorem 3.1 is based on standard techniques for the comparison of Markov 
chains, see Lemma 2.5, together with an appropriate choice of an auxiliary Markov 
chain that can be compared to both, Swendsen-Wang and heat-bath dynamics. For the 
remaining section fix a graph G = (V,E), some j3 > and g 6 N, and recall that we 
denote by 7r the measure for the g-state Potts model on G at inverse temperature (3, see 
(2.9). We will analyze the auxiliary Markov chain with transition probability matrix 

Q — -Phb-Psw-Phb (3.2) 

with P H b from (2.18) and Psw from (2.21). Since Phb and Psw are reversible with 
respect to tt, Q is also reversible. 

The first lemma shows that a Markov chain with transition matrix Q has a larger 
spectral gap than the heat-bath dynamics. 

Lemma 3.6. With the definitions from above we get 

A(Q) > A(Phb). 

Proof. With S n (cr, r) = tc(t) for a, r G fip, we have Q — S n = (P H b — SV) -Psw (-Phb — S w ), 
which is self-adjoint. Hence we can write the spectral gap (see (2.8)) as 

l-A(Q) = WQ-S^l = IKPhb-^Psw^hb-^IL 
< 1 1 -Phb — 11-PswlL < 1 1 -Phb — ^11^ 
= I-A(Phb), 

where we use sub-multiplicativity of the spectral norm as well as HPswII^ < 1 and 
1 1 Phb — < 1- 

□ 

To prove a lower bound on the spectral gap of Psw it remains to prove A (Psw) > c\(Q) 
for some c > 0. For this we need an estimate of the transition probabilities of the 
Swendsen-Wang dynamics on G with respect to some subgraph of G. Therefore we 
denote the transition matrix of the Swendsen-Wang dynamics for the g-state Potts model 
on a graph G at inverse temperature f3 throughout this section by Pq, i.e. 

Pg '■= Pswa<?' (3-3) 
cf. (2.21). We prove the following lemma. 
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Lemma 3.7. Let G = (V,E) be a graph and Gq = (V,Eq) be a spanning subgraph of G 
with E C E. Then 

a[ E ^P Go (a,T) < P G (a,r) < af^ 1 P Go (a,r) 
for all a,r G Q~p, where 

ai = ai((3) := e^ 13 

and 

a 2 = a 2 (P,q) := l + g(e"-l). 

Proof. The first inequality is already known from the proof of Lemma 3.3 in [5], but 
we state it here for completeness. Let p — 1 — e _/3 and note that Eo(a) C E(a) for all 
a e fJ P . We obtain by (2.21) that 



Age 

> P lAl ( X - p) lE{a)HAl q~ c(A) t(Ac E (a) n E (r)) 

Ac Bo 

= (l- P r^-^P Go (a,r) > (l- P r\^P Go (a,r). 

For the second inequality suppose for now E = E \ e for some e & E and note that 
c(AUe) > c(A) - 1. We get 

P G (cr, t ) = £ p |A| (1 _ p) l^)|-|A| g - c (A) 
Ac£((r)nE(T) 

Yl P lA{ ~ p) {E{cT)HAl q~ c(A) 

AcE(a)nE(r): 
e£A 

+ J] p \A\ {1 _ p) \E(a) H A\ q -c(A) 

AcE(a)nE(r): 
e£A 



j2 p^(i- P y- - (/ 



J£((j)|-|A'Ue| -c(A'Ue) 

A'CE {a)r\E (r) 

+ £ p |A'| (l _ p) |B( CT )|-|A'| ? -c(A0 

< V p\ A '\ (1 - p)l«»WI-l^l g"^') + P Go (a, r) 

1 — p ^— ' 

= (y^ + l) Poo(^r) = (1 + g(e^-l))P G >,r). 

For |P \ i?o| > 1 one can iterate this technique \E\Eq\ times. This proves the statement. 

□ 
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We use this lemma to prove that the transition probability from o to r is similar to 
the probability of going from a neighbor of a to a neighbor of r. Recall that a v,k is 
defined by a v,k (v ) = k G [q] and a v,k (u) = cr(u), u ^ v. 

Lemma 3.8. Let a, r G Qp, v G V and fc, Z G [g]. T/ien 

P G (<r"V) < a? SGW P G (a,r) 

a 3 = a 3 (/3,g) := g e 2/3 - (q - 1) e^, 
where deg G (v) denotes the degree of the vertex v in G. 

Proof. Define E v := {e G E : u G e} and := (V, £/ \ E v ). Then, v G V is an isolated 
vertex in G v . By the definition of the Swendsen-Wang dynamics we get that 

P Gv {° v '\r v ' 1 ) = P G >,r). 

If we set Eq = E \ E v we get \E \ Eq\ = deg G (t>). We obtain by Lemma 3.7 that 

p („v,k -v,l\ deg G (u) p , v k v,l\ _ dfig G (t)) p / \ 



deg G (») 

< (-1 Po(a,r) 



02 V 

with ai and 02 from Lemma 3.7. 

□ 

Now we are able to prove the main result of this chapter. 

Proof of Theorem 3.1. Because of Lemma 3.6 we only have to prove A(Psw) > c sw HQ)- 
Let 

Psw(ci,n) to a\ 

c := max — — -, (3.4) 

0-1,0-2, Ti,T 2 eflp Ps\N\°~2T r 2) 

Ol~02, Tl~T2 

where a ~ r J^^ey ^-( a ( v ) T ( v )) < 1> i- e - an d r differ in at most one vertex. 
Note that Phb(o"? t) 7^ if and only if a ~ r. We get for <j\,T\ G Qp that 

Q(<Ti,Ti) = Phb{vi,<72) PSW{V2,T 2 ) Phb{t2,Ti) 

02,T2£f2p 

< cP sw (o-i,n) ^W^i,*^) P hb(t 2 ,Ti) 

< gcP S w(o-i,Ti). 
Using Lemma 2.5 we obtain 

HQ) < Qc\(P sw ). 
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It remains to bound c. Recall that deg G (v) < A for all v G V(G). With from 
Lemma 3.8 we get for <7i, Ti, t 2 G flp with o\ ~ o"2 and n ~ r 2 

-Psw(c r l)Tl) . A -Psw(c2,n) , 2A -^SwO^,^) 2A 
"SW(0"2,T 2 ) -Psw(C2,T-2) -Psw(0"2,T"2) 

Finally, 

A(Q) < gcA(Pgw) < gaf A(Pgw) < g (ge 2/3 ) 2A A(P S w)- 
This completes the proof. 

□ 



3.2 A slight generalization 

In this section we present a generalization of Theorem 3.1 that is necessary to handle 
also graphs with a single vertex of maximal degree. The idea behind this modification 
is that the transition probabilities of the Swendsen-Wang dynamics are invariant under 
global flips of the color of all vertices, i.e. 

P;wO, r) = P S wO + k,r + l), a,r G ft P , k, I G [q], (3.5) 

where (a + k)(v) := {cr(v) + k — 1 mod g) + 1 for all v G V. To see this, note that 
P sw (cr, r), see (2.21), does not depend on the precise colors in a and r, but on the 
"edges of agreement" E{a) and E(r), see (2.10), which are invariant under global flips. 
If we now consider the heat-bath dynamics with additional global flips, i.e. in each step 
make one step using Phb and then change the color of all vertices at once by a random 
increment, it is reasonable to conjecture that SW has also an (almost) larger spectral gap 
than this Markov chain. In the following theorem we prove this statement, but state it 
in a different form, namely, we consider the original Markov chains under the condition 
that the color of the configurations is fixed at a single vertex. Because of symmetry 
we can assume that the fixed color equals 1. This can be interpreted as "boundary 
conditions" for the neighbors of this single vertex. 

For some fixed vertex w G V in the graph G = (V, E) we denote by 

A. := {I,---, q} VXW 

the set of all colorings of the vertices V \ w and define a probability measure on A w by 

ir w (a) := qnia 1 ), a G A w , (3.6) 

with 7r from (2.9), where a 1 G f2p is a coloring of V such that o" 1 (n) = cr(u), u ^ w, and 
a l (w) = 1. One may think of ir w as the conditional probability measure on the colorings 
of the vertices V \ w with respect to tt given that w is colored 1. 
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It is not difficult to show (using ideas similar to them of the proof of Theorem 3.9 
below) that the Markov chain on with transition matrix P(er, r) := qPsw(a, r), where 
a, r G f2p and cx, f G A w such that 

0"(v) := (er(-y) — cr(w) mod q) + 1, u G V \ u>, 

is reversible with respect to n w and has the same spectral gap as Psw Additionally, we 
define the heat-bath dynamics that is reversible with respect to tt w similar to (2.18) by 

P&faf) ■= Pgftfa?) = TyT^J £ T * ^L^ MrenU (3.7) 

where v := {r G A w : f(u) = cr(u),Vu ^ v}. The following theorem gives a com- 
parison inequality between P sw and P BB that will be essential to utilize results on the 
heat-bath dynamics for the Potts model with boundary conditions, cf. (2.29), for the 
analysis of the SW dynamics. 

Theorem 3.9. Suppose that Psw is the transition matrix of the Sw ends en- Wang dy- 
namics for the q-state Potts model at inverse temperature (3 on a graph G = (V,E). 
Furthermore, let w G V be any vertex and P BB as in (3.7). Then 

A(Psw) > c sw A(P^ B ) 

with 

c S w = c sw (G,/3,q) := q~ 1 (qe 2fi ) 

where 

A := max deg G (u) 

u£V\w 

is the maximum degree over V \ w. 

Before we prove Theorem 3.9, we present an application of it. For this recall the 
definition of the two-dimensional square lattice Z| = (Vl,2, Px,2) of side length L from 
Section 2.5 and that the boundary of Vt,i is defined by <9Vl,2 := {v = (vi,v 2 ) G Vt,i '■ 
Vi G {1,P} for some i G [2]}. Now we introduce a new auxiliary vertex v* and let Z^j 
be the graph Z|_ 1 with additional vertex v* and edges between v* and all boundary 
vertices u G OVl-1,2- That is Z^J := (V^ 2 , E' L2 , <p) is the (multi-)graph, cf. Section 2.3, 
with vertex set V^ 2 = Vl-i,i Uf* and edge set E\ 2 such that the set of endpoints of the 
edges in E\ 2 satisfies i^{E\ j2 ) = E L _ 12 U {{v*,u} : u G dV^-i^} and, for each of the 
vertices (1, 1), (1, L — 1), (L — 1, 1), (L — 1, L — 1) G V^ 2 , there are two parallel edges to 
v*, see Figure 3.1. Now we can deduce the following directly from Theorem 2.10. 
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Figure 3.1: The graph Z^ 

Corollary 3.10. Let Psw be the transition matrix of the Sw ends en-Wang dynamics for 
the q-state Potts model on Z^ at inverse temperature (3. Let n = \Vl 2 \- Then there 
exists a constant cp = cg(g) > such that 

A(Psw) > - forP<P c (q) 
n 

with P c (q) = log(l + y/q). 

Proof. Let the auxiliary vertex v* be the vertex with fixed color for Theorem 3.9. Then 
we get that P^b from (3.7) equals Phb,i from (2.31). Thus, the result follows from 
Theorem 2.10. 

□ 

We finish this chapter with the proof of Theorem 3.9. 

Proof of Theorem 3. 9. The proof of this theorem is very similar to the proof of Theo- 
rem 3.1. First, recall that for a G ftp, a{v) = (o~(v) — a{w) mod q) + 1, v G V \ w, such 
that a G A w . Additionally, for o G A w , we denote by a 1 G Qp the configuration with 
o" 1 (v) = o"(u), w^w, and ^{w) = 1. 

We define, for o G £lp and f G A w , the "flip" transition matrices 

Fi(<7,f) := l(f = a) 

and 

1 9-1 

F 2 (f,a) := -^l(a = f 1 + /). 

(If we consider Fi as an operator mapping from L 2 (tt w ) to L2(tt), then P 2 = F*.) 
With Ti from (2.9) and tt w from (3.6), it is easy to check that nFi = ix w and k w F 2 = ir. 
Following the same ideas as in Section 3.1 with 

Q = F\ -Phb F*2 Psw -^i Prb F*2 j 
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which is reversible with respect to 7r, we get 



\\Q — SnWn — Irl ^HB ^2 — 'S'ttIItt — || Pi (P^B - ^ w ) ^2 || w < 1 1 Phb ~ ^ 1 1 tt™ • 

This proves X(Q) > A(P BB ). It remains to prove that A(Psw) > c sw A(Q). By the 
construction of the Swendsen-Wang dynamics we know that 



Hence we get with 

P;wOi,t~i) 
c := max — — - 

o-i,o-2,Ti,T2en P Psw( cr 2)T2j 



cf. (3.4), that for all a,r eVL P 



Q(a,r) = P HB(^^i)^(a 1 ,a 2 )P S w(^,r 2 )P^ B (r 2 ,f 1 )P 2 (f 1 ,r) 

CTljflgAu,, 

T2Gf2p 

< cPsw^Sf 1 ) E P hb(^i) E P hb(^t) 



9 

T2:t2~t 



= ?P sw (a,r) E P hb(^3,t) 

T3 
T 

< gcP S w(o",T). 



r3_GA„: 

T 3 ~T 



With 03 from Lemma 3.8 we get for a±, a 2 , t±, t 2 G Op with erf ~ <r 2 and t\ ~ r 2 x that 
PswCgVnj ^ Psw(o r 2,n) , 2 a Psw(o"2,r 2 ) _ 2 a 

-Psw(0"2,T"2j ~~ Psw(C2,T2) _ Psw(0"2,T"2) 

because deg G (f ) < A for all v G V\w. By the same ideas as in the proof of Theorem 3.1 
(and Lemma 2.5) we conclude 

A(Q) < gcA(Pgw) < qaf A(P SW ) < g (ge 2/3 ) 2A A(P SW )- 



This completes the proof. 

□ 
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Comparison with single-bond dynamics 



The goal of this chapter is to present a comparison of Swendsen-Wang and single-bond 
dynamics for the random-cluster model. In fact, we prove that the spectral gaps of 
both Markov chains are equal up to a small polynomial in the number of edges of the 
underlying graph G = (V,E), i.e. up to c|_E|log|.E| for some c < oo. Especially this 
proves that rapid mixing of both Markov chains is equivalent. 

The plan of this chapter is that we first present a representation of both Markov 
chains (or their transition matrices) on the joint (FKES) model, see (2.12), using the 
same "building blocks". Then we provide some technical lemmas that yield estimates 
on the norm of products of operators between (not necessarily equal) Hilbert spaces. 

Using these ingredients we obtain several new rapid mixing results for the single-bond 
dynamics, but we can also use Theorem 2.19 to get a result on Swendsen-Wang dynamics 
on graphs of bounded linear-width. Additionally we adopt the result of Theorem 2.12 to 
prove matching upper and lower bounds for the spectral gap of the single-bond dynamics 
on the d-dimensional torus, see (2.32), at the critical temperature that show slow mixing 
if the number of colors is large enough. 

4.1 Common representation 

We want to represent the Swendsen-Wang and the single-bond dynamics on the joint 
model, which consists of the product state space flj := flp x and the FKES measure. 
This was done first in [68] and we follow the steps from this article. First recall that, if 
G — (V,E),p= 1 — e _/3 and q e N are fixed, we denote by fi (resp. v) the random-cluster 
(resp. FKES) measure on G with parameters p and q, see (2.11) and (2.12), and that 
L 2 (/i) (resp. L 2 (i')) is the Hilbert space that consists of Mp RC (resp. with the inner 
product (•, -) M (resp. (•, •)„). 

We introduce the stochastic matrix M that defines the mapping (by matrix multipli- 
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cation) from the RC to the joint model by 

M(B, (cx, A)) := q- c{B) 1 (A = B) t(B C E(a)) (4.1) 

for B G fiptc and (a, A) 6 Qj. Obviously, we get by definition of \i and v that M satisfies 
u.M(a,A) = J2 BenRc fi(B) M(B , (a, A)) = u(a,A). (See the discussion after (2.15) for 
a possible interpretation of such an equality.) Like in (2.4), M defines an operator that 
maps from L 2 {y) to L 2 (u) by 

Mf(B) := J2 M(B,(a,A))f(a,A), / G L 2 {v). (4.2) 

Its adjoint operator M*, i.e. the operator that satisfies 

(M*g,f) v = (g,Mf)^ for all / G L 2 (u),g G L 2 (/i), 

can be given by the (stochastic) matrix 

M*({a,A),B) = t(A = B). 

One simple and useful property of the matrices M and M* is that their product sat- 
isfies MM* (A, B) — ~1(A = B) for all A,BE ^rc, or equivalently, the corresponding 
operators satisfy MM* = I with If :— f for all / G L 2 (n). 

Remark 4.1. The matrices from above will be used with the following interpretation in 
mind. We want to construct a Markov chain for the RC model that is reversible with 
respect to u.. This could be done, starting in a RC configuration, by making a "step" 
with M first, which lead to a configuration in the joint model (with an unchanged RC 
coordinate). Then make updates of the RC coordinate in the joint configuration that 
are reversible with respect to v. Finally, forget about the coloring to obtain a new RC 
configuration. At first sight this is no advantage, but we will see that it is possible in 
some cases to find a representation of Markov chains for the RC model by rather simple 
update rules on the joint model, while the original updates are difficult to handle. 



The following matrices provide the updates of the "RC coordinate" in the joint model. 
For {a, A), (r, B) G fij and e G E with e = {e^, e^} let 

{p, B = A U e and a(e«) = a(e^) 

1-p, B = A\e and cr(e (1) ) = <r(e (2) ) (4.3) 
1, B = A\e and a(e«) ^ a(e^). 

The following lemma shows some interesting properties of the matrices from (4.1) and 
(4.3). For example {T e } eS £ is a family of commuting projections in L 2 {y). This will be 
important in the proof of the main result. 
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Lemma 4.2. Let M , M* and T e be the matrices from above. Then 

(i) M*M and T e are self-adjoint in L 2 (v). 

(ii) T e T e = T e and T e T e i = T e iT e for all e, e' G E. 

(iii) \\T e \\ v = 1 and \\M*M\\ V = 1. 

Proof. The first part of (i) is obvious since (M*M)* = M*(M*)* = M*M, see e.g. [35, 
Thm. 3.9-4]. The second part can be checked easily using (2.6). Part (ii) comes from the 
fact that the transition probabilities depend only on the "coordinate" that will not be 
changed. For (iii) note that \\M*M\\ U = \\MM*\\^ see [35, Thm. 3.9-4], and MM* = I 
with // := /, / e L 2 (fi). Hence, ||M*M|| I/ = ||/|| = 1. It remains to prove \\T e \\ u = 1. 

By T e T e = T e and the self-adjointness of T e we obtain \\T e \\ v = ||T"e IL ~ ll^e||^- This 
implies \\T e \\ v G {0, 1}, but since T e g — g,if g G L 2 (v) is constant, \\T e \\ v cannot be zero. 



We finish this section with a lemma that demonstrates the relation of M and T e to the 
Swendsen-Wang and single-bond dynamics. Recall that P$w and Psb are their transition 
matrices, see (2.22) and (2.27). 

Lemma 4.3. Let M, M* and T e be the matrices from above. Then 



From Lemma 4.2(H) we know that the order of multiplication in (i) is not important. 
Proof. For (i) note that 



□ 





UT e )((a,A) 



(r,B)) = l(a = T)l(BcE(a))pW(l-p)W\-\ B \. 



Hence, 




aeflp v 

=l(AUBGE(a)) 

Psw(A,B). 



J2 Q' C(A) MA C E(a))t(B C E(a)) p^(l - p)m*)H B \ 
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For part (ii) we define t e (a) := t(a(e^) = a(e^)) and t e (A) := l(e« A e&) for 
a G f2p, A G f2 RC and e G £ with e = {e (1) , e (2) }. Now write 

T e ((cr, A), (a, B)) = t(B = A\e) +pt e (a)\t(B = AUe) -t(B = A\e) 

and note that |{cr G f2 P : Ac £?(cr)}| = g c ^ A ^ (because the colorings have to be constant 
on each of the c(A) components) and 



-c(A) 



E i. 

a:AcE(a) 



{*) = - + MA) ( i - - 

9 V 9 



Hence, 



MT e M* [A, B) = ^2 t(A C E(a)) T e ((cr, A), (a, B)) 

= t (B = A \ e) +p \t (B = A U e) - 1 (B = A \ e) 



-c(A) 



E t 

a:AcE(a) 



p, B = Alle and e (1) e (2) 
1 - p, B = A \ e and e« A 
2 , B = iUe and e (1) A e (2) 
/i ■ A \ ( and f ' ' ' ' 



1-2, B = A\e and e« A e< 2 ). 
Taking the sum over E and dividing by \E\ show equality to (2.27). 



□ 



Remark 4.4. We know from Lemma 4-2 that T e , e G E, is self-adjoint and idempotent. 
Hence, T e is positive, i.e. (T e f,f) u > for all f G L-iiy), see [35, Thm. 9.5-1 & 9.5-2]. 
In fact, all eigenvalues ofT e are either and 1. Additionally, Y\ e&E T e is positive, because 
it is the product of commuting and self-adjoint linear operators, see [35, Thm. 9.3-1]. It 
follows, 

(Psw<7,<7>„ = (^M\JlT}jM*g,gJ = (Jj\T^ M*g,M*gJ > 



and 



(Psb<?,<?) m = T^E (MT e M*g,g)^ = ]T (T e M*g,M*g) v > 



eeE 1 1 eG-E 

/or all g G L 2 (fi). This shows that the transition matrices of Swendsen-Wang and single- 
bond dynamics are positive, which yield that they have only non-negative eigenvalues, see 
[33, Obs. 7.1.4]. 
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4.2 Technical lemmas 



In this section we provide some technical lemmas that will be necessary for the analysis. 
We state them in a general form, because we guess that they could be useful also in 
other settings. First let us introduce the notation. Consider two Hilbert spaces Hi and 
H 2 with the corresponding inner products {-,-)hi and (-,-)h 2 - The norms in Hi and H 2 
are defined as usual as the square root of the inner product of a function with itself. 
Throughout this section we consider two bounded, linear operators, R : H 2 — > Hi and 
T : H 2 H 2 , such that 

• T is self-adjoint, i.e. T = T*, and 

• T is positive, i.e. (Tg,g)n 2 > for all g G H 2 , 

We denote by || • (resp. || • \\h 2 -^Hi) the operator norms of operators mapping from 
Hi to Hi (resp. H 2 to Hi), i.e. 

ll-R||tf 2 ->ffi := max ll-fttflltfi 

IMIir 2 =1 

and || • \\h 1 as in (2.7). It is well-known that the adjoint operator of R, i.e. R* : H\ — > H 2 
with (R*f,g) H2 = (f,Rg) Hl for all / G Hi and g G H 2 , satisfies \\R*\\ Hi ^h 2 = \\R\\h 2 -+Hi, 
see e.g. Kreyszig [35, Thm. 3.9-2]. Additionally note that self-adjointness of T implies 
that the operators RT k R*, fceN, are self-adjoint operators on Hi. 



Lemma 4.5. Let T and R be as above, then 

\\ RT k+l R *\\ < || T | 
II 1 1 Hi 



\RT R* 



\h 2 ii-"- 

for all k eN. In particular, if \\T\\ H < 1 this proves monotonicity in k. 

Proof. By the assumptions, T has a unique positive square root T, i.e. T = TT, which 
is again self-adjoint, see e.g. [35, Th. 9.4-2]. Using that H^H^,^^ = HAA*]]^ for every 
bounded linear operator A : H 2 — > Hi, we obtain 



\RT k+1 R 



I Hi 



< 



2 



Hi 



RT 
T 



T 



H> 



H 2 -^H 1 

\\RT k R* 



2 

Hi 



RT k+l 

U RT™R* 



2 

H 2 ^H X 
2k 



\T\ 



Hi 



Hi 



I Hi 



Lemma 4.6. In the setting from above let additionally \\R\\h 

i 2 k 



□ 

\RR*\\ Hl < I, then 



\RTR*\\' Hi < 



RT 2 R* 



Hi 



for all k G N. 
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Proof. The case k = is obvious. Now 
\\RTR 



i Hi 



\RTR*\\t 12 < 



< 
< 



RT 

RT 2k R 



Hi 
2 



H 2 ^Hi 



Hi 



suppose the statement is correct for k — 1, then 
RT 2 * 1 R* 
R* u2 



2 

Hi 



\H 1 ^H 2 



RT 



,2 k ~ 1 r r 2 k ~ 1 



R* 



\RR* 



Hi 



\H 2 



□ 



which proves the statement for k. 

The next corollary combines the statements of the last two lemmas to give a result 
similar to Lemma 4.6 for arbitrary exponents. 

Corollary 4.7. Additionally to the general assumptions of this section let \\T\\ H < 1 
and \\RR*\\ Hi < 1. Then 

\\RTR*\\ 2k < \\Rr k R*\\„ 

II II Hi — M \\Hi 

for all fceN. 

Proof. Let I = |~log 2 k] such that k < 2 e < 2k. Since H-RTi?*!!^ < 1 by assumption, we 
obtain 



\RTR 



.* ii2fe 



Hi < \\RTR*f Hi < 



.1 LA.Q 



RT 2 R* 



ifi 



L.4.5 , 

< \\RT k R*\ 



Hi ■ 



□ 



In the following section we will apply these bounds for a specific choice of R and T. 



4.3 The result 

We prove the following theorem, see [67, 68]. 

Theorem 4.8. Let Psw (resp. Psb) be the transition matrix of the Swendsen-Wang 
(resp. single-bond) dynamics for the random- cluster model on a graph with m>3 edges. 
Then 

A(Psb) < A(Psw) < 8mlogm A(Psb). 

This theorem shows that the Swendsen-Wang dynamics is rapidly mixing if and only 
if the single-bond dynamics is rapidly mixing, since the spectral gaps can differ only by 
a polynomial in the number of edges of the graph. In Section 4.4 we will state some 
of the results that follow from the (already discussed) results for the Swendsen-Wang 
dynamics. 

The proof of Theorem 4.8 is based on the bounds of the last section and a suitable 
choice of the involved operators. Recall that we consider both dynamics on a graph 
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G = (V, E) with m edges, i.e. m = \E\. Fix an arbitrary ordering ei, . . . , e m of the edges 
e G E. We set the Hilbert spaces from the last section to 

H x := L 2 (fi) and H 2 : = L 2 {y) 

and define the operators 

T := - Y T e 

i=l 

and 

T := 

i=i 

where the T e , e G are from the common representation of Section 4.1, see (4.3). Recall 
that P S b = MTM* andP S w = MTM* by Lemma 4.3 with M from (4.1). Additionally 
we define 

m 

i=i 

for a G N m . By Lemma 4.2(h) we obtain for a, 7 G N m that 7^ = 7^ if and only if 
{i : cti = 0} = {i : 7^ = 0}. Furthermore, T a — T iff > for alH = 1, . . . , m. 

To conclude the proof of Theorem 4.8 we need two lemmas that are stated in the 
sense of Section 4.2. Afterwards we will see how this implies the result. The first lemma 
follows from Lemma 4.5. 

Lemma 4.9. Let R : H 2 — )■ H\ be a bounded, linear operator. Then 

\\RTR*\\ Hi < \\RT k R*\\ Hi 

for all fceN. 

Proof. See Remark 4.4 for arguments that show positivity of T. Additionally, Lemma 4.2 
leads to HTH^ < ^ Y^ILi ll^ej/^ = 1- Hence T satisfies the assumptions of Lemma 4.5. 
We obtain 

H-Kr^ll < \\RT k R*\\ Hi for every k < £. 

Note that lim^oo T e = T (in norm topology) implies hm 1 1 RT £ R* 1 1 ^ = \\RTR*\\ Hi , 
which implies the result. 

□ 

Lemma 4.9 (or Lemma 4.5) shows that, while k approaches infinity, H-RT^i?*!^ mono- 
tonically approaches \\RTR*\\ Hi . This suggests that, for k large enough, these norms 
are close to each other. The next lemma yields such a reverse inequality. 
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Lemma 4.10. Let k = [mlog™] and R : H2 — > H\ be a bounded, linear operator with 

\RT k R*\\ TJ < (l-e)\\RTR*\\ lJ + e. 



\RR*\\ Hl < 1. Then 



\H 1 - ^ ~> II"' 



Proof. Define the index sets I m ^ '■= {ct G N m : Y^Li a i = ^} an d Imk := i a £ ^m,fc : 
a, > 0, Vi = 1, . . . ,m}. Let := J m>fc \ and denote by (*), for a G 7 TO)fc , the 
multinomial coefficient, i.e. 

'AA fc! 



Obviously (by the multinomial theorem [30, eq. (2.21)]), 



E 



a 



m fc 



and 



e : se e j = e : =-<"■ -i)'- 



We write 



7*= 1 E tJ ^ E (*) r „ 

\ m z — ' / mr z — 4 \a J 

\ i=i / ae/ m , fc v 7 

= tE Hr. + ^E (V- 

ae/ 1 t v 7 ae/° . v 7 

Note that we use for the second equality that the T e 's are commuting by Lemma 4.2(h). 
Since we know that T a = T for every a G 1^ fc (note that 1^ k = for k < m) and 
\\RT a R*\\ Hl < 1 for every a G I TO) fe, we obtain 

s ^ E © ll^*L, + ^ E (*) II^L, 

11^^*11 + Z •»■!>■ 



Using (1 — a)c + a < (1 — b)c + 6 for c < 1 and a < 6, it follows 



i2T*i2* „ < 1-m 1 LRTiT „ + m 1 

» \ m J J 11 UHl \ m 

Setting k = [to log— ] yields the result. □ 
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Now we are able to prove the comparison result for SW and SB dynamics. For 
this let Si(B,(a,A)) : = z/(ct, A) for all B G f2 RC and (o~,A) G fij, which defines an 
operator (by (4.2)) that maps from H 2 to Hi. The adjoint operator S* is then given by 
Sl((a, A), B) := u,(B) and thus, S^A, B) = S„(A, B) = fi(B) for all A, Be Vl KC . For 
the proof we choose the operator 

R := M-St 

with M from (4.1). This operator has some useful properties which can be deduced 
directly from the properties of M and Si. First of all, note that the matrix corresponding 
to SI is constant in the first (FKES) coordinate. This readily implies MS 1 * = S^. But S^ 
defines a self-adjoint operator on Hi := L 2 (fi) and hence, S^ = S 1 * = (MS 1 *)* = SiM*. 
It follows that 

RR* = (M - Si) (AT - SI) = MM* - S„ = I - S„, 

with If := / for all / G L 2 (/i), because MM* (A, B) = t(A = B). Using that (RR*) 2 = 
(I - S^) 2 = I — Sp = RR*, we obtain that R satisfies G {0, 1} and thus the 

assumption of Lemma 4.9 and 4.10. Note that ||i?-R*|| Hi = 1 whenever m > 1, since then 
there always exists a non-constant function / G Hi with (/, 1) M = 0, i.e. (/ — S^)f = f. 
Additionally, we get from Lemma 4.3 that 

-Psb — S^ = RTR* 
and _ 

-Psw — S^ = RTR*. 

We finish the section with the conclusion of the proof. 
Proof of Theorem 4-8. Recall that by definition 

A(P S b) = 1 - \\Psb - S,\\ Ri = 1 - \\RTR*\\ Hi 

and 

-M-^sw) = 1 — ||-Psw — ^mIIj^ = 1 — \\R'7~R*\\ Hi . 

Obviously, Lemma 4.9 (with k = 1) implies the first inequality of Theorem 4.8. 
Now let k = |~m log — ] . Since we know from Lemma 4.10 that 



\\RTR*\\ Hl > ^{WRT'R^-e 



we obtain 



a(p sw ) = i-\\rtr*\\ Hi L '<° i-j^(\\Rr k Rr\\ Hi -e 

1 / ,, , \ Coro. 4.7 I 

1- RT h R* „) < [l-WRTR* 



\2k 



l-e\ 11 " H J - l-e\ 11 UHl 

< ^ £ (l-\\RTR*\\ Hl ) = ^- £ X(Psb), 
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where the last inequality comes from 1 — x k < k(l — x) for x G [0, 1]. Setting e = |, we 
obtain = Ak < 8m log m. This proves the statement. 

□ 



4.4 Applications 

We present three applications of Theorem 4.8. The first one is a result on the Swendsen- 
Wang dynamics on graphs with bounded linear-width that is based on a result of Ge and 
Stefankovic [25], see Theorem 2.19. For this recall the definition of the linear-width of a 
graph G = (V, E) as the smallest number t such that there exists an ordering ei, . . . , e\E\ 
of the edges with the property that for every % G [\E\] there are at most I vertices that 
have an adjacent edge in {ei, . . . ej} and in {e^+i, . . . £\e\\- 

Corollary 4.11. Let P^w be the transition matrix of the Swendsen-Wang dynamics for 
the random-cluster model with parameters p and q G N on a graph G = (V, E) with 
linear-width bounded by t. Let m := \E\. Then 

A(Psw) > ^ — 2 - (4.4) 
2 q t+i m z 

Note that Lemma 2.6 shows that the result holds true if we replace the random-cluster 
model by the corresponding Potts model at inverse temperature f3 = — log(l — p) and 
-Psw by -Psw- 

Proof. The result follows from Theorem 4.8 together with Theorem 2.19. 

□ 

Now we turn to the single-bond dynamics. The second result of this section is, anal- 
ogously to Theorem 2.10 and 3.4, a result on rapid mixing on Z| = (Vl,2, El^), i.e. on 
the two-dimensional square lattice of side length L, at high temperatures. 

Corollary 4.12. Let P^b be the transition matrix of the single-bond dynamics for the 
RC model on 1? L with parameters p and q > 1. Let m = 2L(L — 1) = \El^\ ■ Then there 
exist constants c p = c p (q), d > and C < oo such that 

~ c 

A(P S b) > — tt forp<p c (q) 

m A log m 

and 



A(Psb) > c' m for q = 2 and p = p c (2), 



where p c (q) = j^=. 
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Proof. The bounds for the Swendsen-Wang (SW) dynamics for the Potts model from 
Theorem 3.4 and Lemma 2.6 show that SW is rapidly mixing in the desired range of p. 
Theorem 4.8 leads to the specific bounds (since |Vl )2 | < \El,2\ = 2L(L — 1) < 2 |Vl )2 |). 

□ 

In the next chapter we present techniques that allow us to relate the spectral gap of 
local Markov chains for the random-cluster model at high and low temperatures. In par- 
ticular, we will extend the first bound of Corollary 4.12 to all non-critical temperatures, 
see Theorem 5.10. 

The third and last result that we want to present here yields tight bounds on the 
spectral gap of the single-bond dynamics on the d- dimensional torus 1L d L of side length 
L, see (2.32), at the critical temperature. These bounds show slow mixing if q is large 
enough. This complements (and also uses) a result of Borgs, Chayes and Tetali [5], see 
Theorem 2.12. 

Theorem 4.13. Let Psb (resp. Psw) be the transition matrix of the single-bond (resp. 
Swendsen-Wang) dynamics for the random- cluster model on % d L , d > 2, with parameters 
p and q. Then there exist constants k\, k 2 < oo and a constant k 3 > (depending all on 
d, (3 and q) such that, for q and L large enough, 

e -(*i+fa/^ d - 1 < A(P SB ) < e"^" 1 

and 

if p = 1 — e~ /3c<yd,q ^ with (3 c (d,q) from (2.30). The lower bounds hold for allp, q and L. 

Proof. We obtain the second inequality immediately from Theorem 2.12 and Lemma 2.6. 
The result on the single-bond dynamics thus follows Theorem 4.8. 

□ 

The original proof of the lower bound on A(Psb), as given in [68], uses the bound of 
Theorem 2.19 together with a bound on the linear-width of r L d L . 

Remark 4.14. There are some more results on rapid mixing of the single-bond dynamics 
that follow directly from Theorem 4-8. These include rapid mixing on trees and cycles 
(cf. Theorem 2.17), on the complete graph (cf. Theorem 2.16) and on graphs with bounded 
degree if p is small enough (cf. Theorem 3.3). We do not state them here explicitly, 
because our main purpose is to prove rapid mixing of Swendsen-Wang dynamics. 
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This chapter is devoted to the study of the spectral gaps of Swendsen-Wang and single- 
bond dynamics if the underlying graph for the random-cluster model has a special struc- 
ture. Namely, we consider planar graphs, i.e. graphs that can be drawn in the plane 
without intersecting edges. For each such graph it is possible to define a corresponding 
dual graph and we will see that it is possible to prove bounds on the spectral gap (of 
both dynamics) on the original graph in terms of the spectral gap on the dual one, if 
the temperature is suitably changed. Especially, we prove results on rapid mixing of 
the dynamics at low temperatures, where the previous techniques, which rely on rapid 
mixing of single-spin dynamics for the Potts model at the same temperature, do not 
apply. 

The plan of this chapter is as follows. In Section 5.1 we introduce the notion of 
representations of graphs in the plane and, consequently, the notion of planar graphs. 
This leads to the construction of dual graphs and we will see that there is a tight 
connection between the random-cluster model on a graph and its dual. In Section 5.2 
we use this connection to prove the main results of this chapter, i.e. the comparison of 
the spectral gaps on a graph and on its dual graph. This ends in a proof of rapid mixing 
of the considered dynamics on the two-dimensional square lattice at all non-critical 
temperatures, see Section 5.3. Finally, in Section 5.4, we present a slight generalization 
of the given results to graphs that can be drawn on a surface with bounded genus. 

5.1 Planar and dual graphs 

The following introduction to embeddings of graphs in the plane and to the construction 
of dual graphs is presented according to Mohar and Thomassen [52] and we refer to this 
monograph for a more comprehensive study of the topic. 

Throughout this chapter we consider only connected graphs. Let a connected graph 
G = (V, E, ip) be given and recall that <p is the function that assigns to each edge e G E 
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the set of its endvertices. For convenience, we will mostly omit tp from the notation. We 
say that G is a planar graph if there is a representation of G in the plane M 2 . That is, 
there exists a finite set V C M 2 , a bijection k from V to V and a set of simple curves 
E = { 7e : [0, 1] — > IR 2 : e G E} in R 2 such that 

• 7 e (0) G V" and 7 e (l) G V" for all e G E, 

• for all u,v V and e G E we have 

{7e(0), 7 e (l) } = {«(«), «(«)} <^(e) = {u,w} 

and 

• 7e ((0,l))n 7 /([0,l]) =0for aUe^feE. 

Note that a curve 7 : [0, 1] — > M 2 is always assumed to be continuous and is called 
simple if it does not cross itself except for its ends, i.e. 7(2) = 7(2/) for x ^ y implies 
x,y G {0, 1}. The above assumptions mean that the ends of all curves are elements of 
V, 7 e connects the elements of V that correspond (by k) to the endvertices of e G E and 
that the interior of each curve is disjoint from all other curves, respectively. The tuple 
G = (V, E) is called a representation of G in the plane and we assume henceforth that 
we fix for every planar graph G a representation in the plane G. In the literature G is 
also called plane graph, planar embedding or drawing of G. See e.g. Figure 2.1 for two 
examples of planar graphs (or, more precisely, their representations in the plane). 

Clearly, being connected by a simple curve in a set C C M 2 defines an equivalence 
relation on C, whose equivalence classes are called regions of C. A face of C is a region 
of M 2 \ C and we call the faces of Uees 7e([0j !])> i- e - ^ e f aces of the union of the images 
of all curves of G, the faces of G. Note that since we consider finite graphs, there is 
exactly one unbounded face. We call it the outer face of G. In the following we need the 
number of faces of a representation G of a graph G, which we denote by F(G). One may 
think that this number depends on the choice of the representation, but the following 
lemma, which is known as Euler's (polyhedral) formula, shows that this is indeed not 
the case, see e.g. [52, Prop. 2.2.3]. 

Lemma 5.1 (Euler's formula). Let G = (V, E) be a connected planar graph and G be a 
representation of G in the plane, then 

2 = \V\ - \E\+F(G). 

This proves that the number of faces does not depend on the particular representation 
and thus we write F(G) for the number of faces of any representation of G. 

Additionally to the statement of Lemma 5.1 for the whole graph G, we need a result 
on spanning subgraphs of it. For this recall that we denote by Ga = (V,A), A C E, 
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the graph G without the edges of E \ A and that c(Ga) (or simply c(A)) denotes the 
number of connected components of Ga- The representation of Ga in the plane is fixed 
to be the representation of G without the curves corresponding to the edges of E \ A 
and we write F(A) for the number of faces of the (fixed) representation of Ga- The next 
equation is a simple corollary of Lemma 5.1. 

Corollary 5.2. Let G be a planar graph and Ga, A C E, be a spanning subgraph of G, 
then 

c(A) + l = \V\-\A\+F(A). 

Proof. We know from Lemma 5.1 that for each connected component Gi = (Vi,Ei) 
of Ga, i = 1, • • • ,c(A), we have 2 = \Vi\ — \Ei\ + F(Gi). Summing over i leads to 
2c(A) = \V\ — \A\ + F(Gi), but in the last sum we count the outer face c(A) 

times. Subtracting c(A) — 1 on both sides yields the result. 

□ 

Now we turn to the definition of the dual graph of a graph G. For this consider the 
(fixed) representation in the plane G — (V,E) of the connected graph G = (V,E). We 
define the dual graph as the (unique) graph = {V\E^) such that its representation 
in the plane G* = (V',E') satisfies that 

• there is exactly one E in every face of G and 

• for every e E E there is exactly one simple curve 7J E E< with endpoints in 
V"t that intersects 7 e E E, but not 7/, / 7^ e E E. 

Note that, for every j e E E, the endpoints of the corresponding "dual" curve 7^ are 
unique, since there is only one vertex in every face of G and a curve can separate at 
most to faces from each other (by Jordan's Curve Theorem [52]). We call the edge 
ej E E^ in the dual graph that belongs to the dual curve 7! the dual edge of e E E. 

In general, the definition of the dual graph of G depends on the representation of G in 
the plane. Therefore recall that we fix such a representation whenever we fix the graph. 
This implies that the dual graph is well-defined, unique and that the dual of the dual 
graph is the original graph. (Here we presume that the representation in the plane of 
the dual graph is fixed to be the one constructed above.) 

Remark 5.3. Note that the dual graph of any planar graph is connected. Therefore 
the connectivity of G is necessary to obtain that the dual of the dual graph of G is G. 
Additionally, it is known, see [52, Thm. 2.6.7], that if we restrict to 3- connected planar 
graphs, i.e. planar graphs with at least 4 vertices such that every subgraph obtained by 
deleting two vertices is still connected, then the dual graph is unique and thus independent 
of the representation. 
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In view of the application to Markov chains for the random-cluster model we also 
define a dual configuration A> C E* on the dual graph G* for every random-cluster 
configuration A C E on G by the property that for all e £ E (and corresponding 
e t £ £t) 

e t £A f <^> e^A (5.1) 

Thus, a dual edge is present in whenever the original one is absent in A. In particular, 
\E\ = \EM = \A\ + lAH for every A C E. See Figure 5.1 for an example of a graph and 
its dual together with a proper pair of RC configurations. 
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Figure 5.1: Left: The graph Z3 (dots and solid lines) and its dual graph Z3 (crosses and 
dashed lines). Right: A RC configuration on Z3 and its dual configuration. 



Recall that Ga, A C E, is the representation of the spanning subgraph = (V, A) 
in the plane that is adopted from G by forgetting the curves corresponding to E \ A and 
c(A) is the number of connected components of Ga- With a slight abuse of notation we 
write c(A^), A^ C E\ for the number of connected components of G Ai . 

It is easy to verify, cf. Figure 5.1, that every face of Ga contains a unique connected 
component of £rt t and vice versa. Thus, we have c(A^) = F(A). Using Euler's formula, 
see Corollary 5.2, we obtain 



c(A) + 1 



c(A*) + \V\ 
c(A ] ) + \V\ 



\A\ 

\E\ + U f | 



(5.2) 



This equality leads immediately to the following lemma that shows the tight connection 
between the random-cluster measure on G and G* , see e.g. Grimmett [29, eq. (6.4)]. 
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Lemma 5.4. Let ^ be the random- cluster measure on a planar graph G = (V, E) with 
parameters p and q. Additionally, let be the dual graph of G. Then 

/4(A) = for all AcE, 

where p* satisfies 



p* q(l — p) 



1 — p* p 



(5.3) 



Proof. Recall from (2.11) that 



where Z[G,p,q) = 2^ j 4cb(i^)' j4 ' * s ^ e normalization constant. With (5.2) and 
(5.3) we obtain 

/ \ \A\ 

P ) q c(A) (5j) / P \ qC{A t) + \V\-\E\ + \At\-l 
l-p) \1-P) 



\A\ 



(5^3) f q(l -p ) V 1 1 c(A t) + | V |_ [E[+|At| -1 



p* j 

f 1 " 1 T^T g C(At) - 



1 — p* 

Thus, with Z(Gt,p*,g) = g 1 -' y l(p7(l -p*)) m Z{G,p,q) we get //g g (A) = /$,(At) as 
desired. 

□ 

The relation from (5.3) has a unique self-dual point, i.e. a value of p such that p = p*. 
This value is given by 

*-(,) == T^=. (5-4) 

Note that p s d(l) equals 1 — e^ c ^ q \ where (3 c (q) is the critical inverse temperature for the 
g-state Potts model on the two-dimensional square lattice, see Remark 2.9. 



5.2 Dynamics on planar graphs 

We prove that the spectral gap of the Swendsen-Wang and the single-bond dynamics for 
the random-cluster model on a planar graph G is bounded from above and from below 
by the spectral gap of the corresponding dynamics on the dual graph if we change 
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the temperature parameter p to p* from the last section. Furthermore, we state a second 
corollary of Theorem 2.13 at the end of this section, that shows rapid mixing of Swendsen- 
Wang dynamics for the Potts model on planar graphs if the inverse temperature is 
small or large enough (depending on the maximum degree). This is a modification of 
Corollary 3.3 to planar graphs, cf. Theorem 2.13. 

For this fix a connected planar graph G = (V, E) (together with its representation in 
the plane), some p e (0, 1) and a natural number q. Let be the dual graph of G and p* 
be the unique value satisfying (5.3), i.e. p* = We call the random-cluster model 

on with parameters p* and q the dual model and abbreviate p^* by p) . Additionally, 
given a Markov chain for the random-cluster model with transition matrix P, we write 
pr for the transition matrix of the Markov chain for the corresponding dual model. 

The first result demonstrates the usefulness of the heat-bath dynamics for the random- 
cluster model, which is one of the two previously defined local Markov chains for this 
model, see (2.24) for the definition of its transition matrix Phb- It turns out that the 
spectral gap of Phb equals the spectral gap of P RB - This is a simple consequence of 
Lemma 5.4 and was probably known before, but we could not find a reference for it. 

Lemma 5.5. Let Pub (resp. P HB ) be the transition matrix of the heat-bath dynamics for 
the random- cluster model on a planar graph G (resp. for the dual model). Then 

A(Phb) = A(J§b). 



Proof. By definition we have for all A, B C E that 

PMAB) = ^Y i ll(A ^l\ (AV) t(B\e^A\ e) , 

see (2.24). Using Lemma 5.4 and that (A U e) f = A ] \ e t (resp. (A \ e) f = A ] U e t ), by 
equation (5.1), we obtain 

ft=(Ai» = ^J2 KAu ^l {AV) MB\e = A\e) 

= pS ^pu4Spw 1((BNe)t=(Me)t) 

Note for the last equality that U ej- = A^ U e-f if and only if B^ \ e^ — A^ \ e^. This 
obviously implies that the matrices Phb arid P HB have the same eigenvalues and thus the 
same spectral gap. 
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We immediately obtain the last ingredient for the proof of rapid mixing of Swendsen- 
Wang and single-bond dynamics on the two-dimensional square lattice at all non-critical 
temperatures, see Section 5.3. 

Theorem 5.6. Let Pg-w (resp. Psb) be the transition matrix of the Swendsen-Wang 
(resp. single-bond) dynamics for the random- cluster model on a planar graph G with m 
edges and let Pg W (resp. P$ B ) be the SW (resp. SB) dynamics for the dual model. Then 

A(Psb) < qX(PL) 

and 

A(Psw) < 8qm\ogm\(?£ w ). 



Proof. With Lemma 2.7 and Lemma 5.5 we get 



A(P S b) < A(P HB ) = X(PL) < 1 -P* 1 " - 



Q 



SBJ 



The constant on the right hand side is maximal for p* = 1. This proves the first 
statement of the theorem. For the second we additionally use Theorem 4.8. Thus, 

A(Psw) < 8mlogmA(PsB) < 8gmlogm A(Pg B ) < 8gmlogm A(Pg W ). 

□ 

Prior to the application of this result to the two-dimensional square lattice in the next 
section we present the above mentioned result on rapid mixing of the Swendsen-Wang 
dynamics at sufficiently high and low temperatures if the underlying graph is planar and 
has bounded maximum degree. This corollary relies on a result of Hayes [31]. 

Corollary 5.7. The Swendsen-Wang dynamics for the random-cluster model with pa- 
rameters p and q on a planar, simple and connected graph G with m edges and maximum 
degree A > 6 satisfies 

AOPsw) > ^— 4 if V < - JL^r , 

m 3V A - 3 

and 

A(psw) > c( !r £) , if p > i - ; 



m? log m ' ' gAt ' 

for some c = c(A,p, q) > and e > 0, where A^ is the maximum degree of a dual graph 
ofG. 
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Proof. From Theorem 2.13 together with the bound on the operator norm of the adja- 
cency matrix of planar simple graphs from [31, Cor. 17] we obtain A(Phb) > (1 — ^)/ n i 
where Phb is the transition matrix of the heat-bath dynamics for the Potts model on G 
and n is the number of vertices, if /3 < e/ a/3(A — 3). Recall that p = 1 — e - ^, which 
implies < p/(l — p), we obtain that the assumption on p yield the desired bound on 
f3. By connectedness of G we have m > n — 1. Thus, Theorem 3.1 completes the proof 
of the first inequality. 

For the second we use A(Psw) > l/(8gmlogm) A(Pg W ) from Theorem 5.6, where Pg W is 
the Swendsen-Wang dynamics for the dual model, i.e. the RC model on the dual graph 
G^ with parameters p* = and q > 1. Since p* < q(l — p), it is enough to prove 

KHw) ^ c{l-e)/m if p* < e/Al But, using /3 < p/(l — p) and m > n — 1, this follows 
from Corollary 3.3 and Lemma 2.6. 

□ 

Note that the maximum degree of the dual graph corresponds to the maximum number 
of edges that are needed to surround a single face in a planar representation of the original 
graph. This quantity is sometimes called maximum face-degree of G in the literature. 

Unfortunately, in the second inequality of Corollary 5.7 the lower bound on p contains 
and not its square root. The reason for this is that the proof of the bound on the 
principal eigenvalue of a planar graph needs that the graph is simple, see [31, Coro. 17]. 
(Note that although Euler's formula is valid also for non-simple graphs, one cannot 
deduce from it that there is a vertex of small degree.) But since the proof of Corollary 5.7 
uses dual graphs, and we cannot guarantee in general that the dual graph is simple, we 
have to use the stronger assumption. If we would consider only 3-connected graphs, 
see Remark 5.3, then it is known that the dual graph is simple. In this case one could 
improve the lower bound on p to 1 — ej (3qVAt — 3) under the additional assumption 
At > 6. 



5.3 Rapid mixing on the square lattice 

The goal of this section is to present an application of Theorem 5.6 to the random-cluster 
model on the two-dimensional square lattice. We prove that Swendsen-Wang and single- 
bond dynamics are rapidly mixing for each q e N if the parameter p satisfies p ^ 
Translated to the g-state Potts model this shows that the Swendsen-Wang dynamics is 
rapidly mixing at all non-critical temperatures, i.e. at all (3 ^ (3 c (q), see Remark 2.9. 

This is done by a successive application of the results of the previous chapters. Espe- 
cially, we need Theorem 3.4, Corollary 3.10, Theorem 4.8 and Theorem 5.6. Note that, 
since we proved only comparison results between different Markov chains, the results 
on rapid mixing rely ultimately on the known results on the mixing properties of the 
heat-bath dynamics for the Potts model, see Section 2.5. 
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First of all, recall the definition of Z| = (Vl,2, El^), i.e. the two-dimensional square 
lattice of side length L, from Section 2.5 and of from Section 3.2. It is easy to see 
that T?2 is indeed the dual graph of Z|, see Figure 5.1. Hence, we obtain the following 
result. 

Theorem 5.8. Let Psw be the transition matrix of the S wends en-Wang dynamics for 
the random- cluster model on Z| with parameters p and q. Let m = 2L(L — 1) = |Pl,2|. 
Then there exist constants c p = c p (q), d > and C < oo such that 
c 

• A(Psw) > — forp<p c (q), 

m 

~ c 

• A(Pgw) > -j7- forp>p c (q), 

m z log m 

• A(Psw) > c'm~ c for q = 2 and p = p c (2) , 
where p c (q) = j^=. 



Proof. The first and the last inequality follow from Theorem 3.4, Lemma 2.6 and the 
fact that \Vl,2 \ < m = 2L(L — 1) < 2 \ Vl,2\- For the second let Pg W be the Swendsen- 
Wang dynamics on Z^ with parameters p* = and q. Corollary 3.10 (together 

with Lemma 2.6) implies that A(Pg W ) > c p */m for some c p * < oo, if p* < p c (q). (Recall 
that p c (q) = 1 — e~P°( q \) But since p* < p c (q) if and only if p > p c (q) we can deduce the 
desired bound from Theorem 5.6. 

□ 

An immediate consequence is the following corollary. 

Theorem 5.9. Let Pgw be the transition matrix of the Swendsen-Wang dynamics for 
the q-state Potts model on 7? L at inverse temperature (3. Let n = L 2 . Then there exist 
constants eg = c@(q), d > and C < oo such that 

• A(Pgw) > - for/3<f3 c (q), 

n 

• A(Psw) > forP>/3 c {q), 

n z log n 

• A(P SW ) > c'n~ c forq = 2 and (3 = /3 C (2), 
where c (q) = log(l + y/q). 



Proof. Apply Lemma 2.6 to Theorem 5.8 and note, again, that n < \El$\ < 2n. 

□ 
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This seems to be the first proof of rapid mixing of a classical Markov chain for the 
two-dimensional Ising model (q = 2) at all temperatures. In fact, in [66] a somehow 
artificial Markov chain, that makes an additional step on the dual graph, is proven to be 
rapidly mixing. However, the bounds as given in Theorem 5.9 are certainly not optimal. 
We conjecture A(Psw) to be bounded below by a constant for (3 ^ c (q)- 

We also obtain the following for the single-bond dynamics. 

Theorem 5.10. Let Psb be the transition matrix of the single-bond dynamics for the 
RC model on 1? L with parameters p and q. Let m = 2L(L — 1) = I-El^I- Then there 
exist constants c p = c p (q), d > and C < oo such that 

KPsb) > -^TZ: forp^Pc(q), 



where p c (q) 



m? log m 

-M-Psb) — c ' m ~ C f or Q — 2 and p = p c (2), 



Proof. The result for p < p c {q) was already given in Corollary 4.12. For the result for 
p > p c {q) take the bound of Corollary 3.10 and apply Lemma 2.6 and Theorem 4.8. □ 



5.4 Graphs of higher genus 

In this final section we give a brief description of embeddings of graphs into surfaces, 
i.e. into two-dimensional topological manifolds, and show that it is possible to extend 
the results of Section 5.2 to this case. For convenience, we consider only connected, 
orientable and closed surfaces, which we simply call surfaces in the sequel. The plan 
of this section is to define the genus of a surface (or a graph) and then, analogously to 
Section 5.1, to define representations of graphs into surfaces. Then we give a formulation 
of Euler's formula, define dual graphs and show the consequence for the random-cluster 
measure. 

For this, let Ti and r 2 be two surfaces and define their connected sum T\ U r 2 as 
the surface that is obtained by cutting out a small disc from each of the surfaces Ti 
and T 2 and gluing them together along the boundaries of the resulting holes. (See 
e.g. Giblin [26] for a more formal definition.) It is known that the surface Ti U r 2 does 
not depend (up to homeomorphisms) on the choice of the discs that are cut out from the 
primal surfaces, see [26, Prop. 2.17]. We consider the surfaces S^, h > 0, which are given 
by Sfc+i := §fc U §!, where §o is the two-dimensional sphere and Si is the torus. That 
is, is the surface that can be obtained from a sphere by adding h handles to it. We 
call E>h the (orientable) surface of genus h. For example, the sphere is a surface of genus 
and the torus a surface of genus 1. Note that each connected, closed and orientable 
surface is homeomorphic to precisely one of the surfaces S^, h > 0, see [52, Thm. 3.1.3]. 
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Let G = (V, E) be a connected graph. In the light of Section 5.1 we say that G has a 
representation in S^, namely G — (V,E), if there exists a finite set V C a bijection 
K from V to V and a set of simple, continuous curves E = {~y e : [0, 1] — > S/j : e G i?} 
in §/j with the same three properties as given in the beginning of Section 5.1. Here, we 
consider planar graphs as graphs that admit a representation in the sphere §o- 

The genus g(G) of the graph G is defined to be the smallest integer h such that 
G has a representation in §^ and we call a representation of G in § g (G) a minimum 
genus representation. For example, the graph Z|, L > 3, i.e. the two-dimensional torus, 
see (2.32), has genus 1. 

Remark 5.11. Every finite graph can be represented in §^ if h is sufficiently large. For 
this consider the complete graph K n , whose genus satisfies g{K n ) = \(n — 3)(n — 4)/12] , 
see [57], and observe that every graph on n vertices can be represented in the same surface 
than K n (since they are subgraphs). Below we will see that there are constants involved 
in the results that depend exponentially on the genus. Therefore, the forthcoming results 
are only useful in cases where the genus is at most logarithmic in the size of the graph. 

Recall that F(G) denotes the number of faces of the representation G. It turns out 
that there is a version of Euler's formula also in this case, see [52, eq. (3.7)]. 

Lemma 5.12 (Euler's formula II). Let G = {V,E) be a connected graph and G be a 
minimum genus representation of G , then 

2-2g(G) = \V\ - \E\+F(G). 

Following the ideas of Section 5.1 we fix a minimum genus representation G of G and 
fix the representation of the spanning subgraphs Ga — (V, A), A C E, to be G without 
the curves corresponding to E \ A. We write F(A) for the number of faces in this 
representation of Ga- Note that a subgraph could be of smaller genus, e.g. G = (V, 0) 
is obviously planar. However, we equip subgraphs of G with a representation in § g (G)- 
We obtain the following corollary, which is the basis of the remaining results. 

Corollary 5.13. Let G be a connected graph and Ga, A G E, be a spanning subgraph 
of G, then 

2-2g(G) < \V\-\A\+ F(A) -c(A) + 1 < 2. 

Proof. We prove that the term in the middle cannot increase if we replace A by AUe, i.e. 
\V\-\AUe\ + F(AUe)-c(AUe) + l < \V\-\A\ + F(A)-c(A) + l. This proves the claim 
since the first and second inequality of the statement become an equality for A = E and 
A = 0, respectively. Thus, it is enough to prove F(A U e) — F(A) < c(A U e) — c(A) + 1 
for all A C E. The right hand side of this inequality equals 1 if the endpoints of e are 
connected in (V,A), i.e. e^He^, and otherwise. Consequently, we want to verify 
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F(A U e) - F(A) < l(e« 4 e^), that is to say that if adding e to A induces a new 
face, then the endpoints of e are connected in Ga- But this is clearly true, because if a 
new face is induced by e, then e is contained in a "cycle" in GAue, which implies that 
the endpoints are already connected in Ga- 

□ 

Given a graph G = (V, E) and a corresponding minimum genus representation G = 
(V,E) we define the dual graph G^ = {V\E^) just as in Section 5.1. That is, is the 
unique graph that has a representation in S g (c) with a single vertex in every face of G 
and, for every curve 7 e G E, a curve that intersects only 7 e and connects the vertices 
(or the vertex) on both sides of j e . We write e\ for the edge that corresponds to this 
"dual" curve. 

A dual configuration A^ C of A C J5 is given by 

e t G A f e ^ A, 

cf. (5.1). It is again easy to convince oneself that every face of the (from G adopted) 
representation of Ga contains a connected component of G .^, i.e. F(A) = c(A^). Thus, 
we obtain 

c(A ] ) + \A ] \ - 1 < c(A) - \V\ + \E\ < c(A ] ) + \A ] \ - 1 + 2g(G). (5.5) 

This implies an analogous result to Lemma 5.4 and, eventually, the following lemma. 
Recall that the dual model to the random-cluster model on G with parameters p and q 
is the RC model on G^ with parameters p* and q, where p* = p ^M^ p \ ■ 

Theorem 5.14. Let Phb? -Psb and Psw (resp. P BB , P$ B and Ps W ) be the transition 
matrices of the heat-bath, single-bond and Swendsen-Wang dynamics for the random- 
cluster model on a graph G (resp. for the dual model). Then 

A(Phb) < g 8s(G) X(PL) 

and hence, 

A(Psb) < g 8s(G)+1 X(PL) 

and 

A(Psw) < 8q s ^ +1 m\ogm\(P sw ). 

Proof. We prove only the first inequality. The other two follow in the same way as 
Theorem 5.6 follows from Lemma 5.5. An easy computation (similar to the proof of 
Lemma 5.5) shows 
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for all Ad E. By the definition ofPnB, see (2.24), this implies 

Phb(A,B) < q^PU^,&). 
But these two inequalities yield A(Prb) < g 8g ^A(Pg B ), see Lemma 2.5. 

□ 

Theorem 5.14 can be used in the same way as Theorem 5.6 to prove results on the 
mixing properties of the involved Markov chains in specific cases. For example, a result, 
similar to Corollary 5.7, on graphs of bounded degree can be proven by using Theo- 
rem 2.13 and a bound on the principle eigenvalue of graphs with bounded genus, see 
e.g. [20]. Another application is that lower bounds on the spectral gap of the Markov 
chains for the Potts model on the two-dimensional square lattice with periodic boundary 
condition (i.e. on the two-dimensional torus Z|) at high temperatures can be translated 
to lower bounds at low temperatures, cf. Section 5.3. We omit the details. 
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